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Preface 


This is the first volume of a two-volume textbook’ which evolved from a 
course (Mathematics 160) offered at the California Institute of Technology 
during the last 25 years. It provides an introduction to analytic number 
theory suitable for undergraduates with some background in advanced 
calculus, but with no previous knowledge of number theory. Actually, a 
great deal of the book requires no calculus at all and could profitably be 
studied by sophisticated high school students. 

Number theory is such a vast and rich field that a one-year course cannot 
do justice to all its parts. The choice of topics included here is intended to 
provide some variety and some depth. Problems which have fascinated 
generations of professional and amateur mathematicians are discussed 
together with some of the techniques for sc!ving them. 

One of the goals of this course has been to nurture the intrinsic interest 
that many young mathematics students seem to have in number theory and 
to open some doors for them to the current periodical literature. It has been 
gratifying to note that many of the students who have taken this course 
during the past 25 years have become professional mathematicians, and some 
have made notable contributions of their own to number theory. To all of 
them this book is dedicated. 


1 The second volume is scheduled to appear in the Springer-Verlag Series Graduate Texts in 
Mathematics under the title Modular Functions and Dirichlet Series in Number Theory. 
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Historical Introduction 


The theory of numbers is that branch of mathematics which deals with 
properties of the whole numbers, 


4233, 455.08 


also called the counting numbers, or positive integers. — 

The positive integers are undoubtedly man's first mathematical creation. 
It is hardly possible to imagine human beings without the ability to count, 
at least within a limited range. Historical record shows that as early as 
5700 Bc the ancient Sumerians kept a calendar, so they must have developed 
some form of arithmetic. 

By 2500 Bc the Sumerians had developed a number system using 60 as a 
base. This was passed on to the Babylonians, who became highly skilled 
calculators. Babylonian clay tablets containing elaborate mathematical 
tables have been found, dating back to 2000 Bc. 

When ancient civilizations reached a level which provided leisure time 
to ponder about things, some people began to speculate about the nature and 
properties of numbers. This curiosity developed into a sort of number- 
mysticism or numerology, and even today numbers such as 3, 7, 11, and 13 
are considered omens of good or bad luck. 

Numbers were used for keeping records and for commercial transactions 
for over 5000 years before anyone thought of studying numbers themselves 
in a systematic way. The first scientific approach to the study of integers, 
that is, the true origin of the theory of numbers, is generally attributed to the 
Greeks. Around 600 Bc Pythagoras and his disciples made rather thorough 
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studies of the integers. They were the first to classify integers in various ways: 


Even numbers: 2, 4, 6, 8, 10, 12, 14, 16, ... 

Odd numbers: 1, 3, 5, 7,9, 11, 13, 15,... 

Prime numbers: 2, 3, 5,7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 
67, 71, 73, 79, 83, 89, 97, ... 

Composite numbers: 4, 6, 8, 9, 10, 12, 14, 15, 16, 18, 20, ... 


A prime number is a number greater than 1 whose only divisors are 1 and 
the number itself. Numbers that are not prime are called composite, except 
that the number 1 is considered neither prime nor composite. 

The Pythagoreans also linked numbers with geometry. They introduced 
the idea of polygonal numbers: triangular numbers, square numbers, pen- 
tagonal numbers, etc. The reason for this geometrical nomenclature is 


clear when the numbers are represented by dots arranged in the form of 
triangles, squares, pentagons, etc., as shown in Figure I.1. 


Triangular: ` 
AAA 

1 3 6 10 15 21 28 
Square: can 521 

H Rl SERE 

1 4 9 16 25 36 49 


Pentagonal: E 
e. G Q 
: 2 1a 22 35 Sa iege 


Figure 1.1 


Another link with geometry came from the famous Theorem of Pythagoras 
which states that in any right triangle the square of the length of the hy- 
potenuse is the sum of the squares of the lengths of the two legs (see Figure I.2). 
The Pythagoreans were interested in right triangles whose sides are integers, 
as in Figure 1.3. Such triangles are now called Pythagorean triangles. The 


corresponding triple of numbers (x, y, z) representing the lengths of the sides 
is called a Pythagorean triple. 
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z 
y x+y =z? 


x 


Figure I.2 


A Babylonian tablet has been found, dating from about 1700 BC, which 
contains an extensive list of Pythagorean triples, some of the numbers being 
quite large. The Pythagoreans were the first to give a method for determining 
infinitely many triples. In modern notation it can be described as follows: 
Let n be any odd number greater than 1, and let 


x-n y=? 1, z=% +1). 


The resulting triple (x, y, z) will always be a Pythagorean triple with z = y 
+ 1. Here are some examples: 


» 3"5 q $9 u'a 


y 4 12 24 40 60 84 112 144 180 


2 “se 13 25 41 "6l 85-113: 4145! 181 
There are other Pythagorean triples besides these; for example: 


x 8 12 16 20 


y 15 35 63 99 


z 17 37 65 101 


In these examples we have z = y + 2. Plato (430-349 Bc) found a method for 
determining all these triples; in modern notation they are given by the 
formulas 
xcán yer — 1, 5-2 =4n? 41 
Around 300 Bc an important event occurred in the history of mathematics. 
The appearance of Euclid's Elements, a collection of 13 books, transformed 


mathematics from numerology into a deductive science. Euclid was the 
first to present mathematical facts along with rigorous proofs of these facts. 


5 
? ATUS ‘hea en 


4 12 
Figure 1.3 
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Three of the thirteen books were devoted to the theory of numbers (Books VII, 
IX, and X). In Book IX Euclid proved that there are infinitely many primes. 
His proof is still taught in the classroom today. In Book X he gave a method 
for obtaining all Pythagorean triples although he gave no proof that his 
method did, indeed, give them all. The method can be summarized by the 
formulas 


x= tqa? — b’), y=2ab z= t(a? + b?), 


where t, a, and b, are arbitrary positive integers such that a > b, a and b have 
no prime factors in common, and one of a or b is odd, the other even. 

Euclid also made an imporiant contribution to another problem posed 
by the Pythagoreans—that of finding all perfect numbers. The number 6 
was called a perfect number because 6 = 1 + 2 + 3, the sum ofall its proper 
divisors (that is, the sum of all divisors less than 6). Another example of a 
perfect number is 28 because 28 1 2-4 4 7 + 14, and 1, 2, 4, 7, and 
14 are the divisors of 28 less than 28. The Greeks referred to the proper 
divisors of a number as its “parts.” They called 6 and 28 perfect numbers 
because in each case the number is equal to the sum of all its parts. 

In Book IX, Euclid found all even perfect numbers. He proved that an 
even number is perfect if it has the form 


Ft. 


where both p and 2? — 1 are primes. 
Two thousand years later, Euler proved the converse of Euclid’s theorem. 


That is, every even perfect number must be of Euclid’s type. For example, for 
6 and 28 we have 


60431002 dye5-3 and 28 = 2?7!(23 — 1) = 4.7, 
The first five even perfect numbers are 


6, 28, 496, 8128 and 33,550,336. 


Perfect numbers are very rare indeed. At the present time (1975) only 24 


perfect numbers are known. They correspond to the following values of p. 
in Euclid's formula: es ON 


2, 3, 5, 7, 13, 17, 19, 31, 61, 89, 107, 127, 521, 607, 1279, 2203, 2281, 
3217, 4253, 4423, 9689, 9941, 11,213, 19,937. 


Numbers of the form 2? — 1, where p is prime, are now called Mersenne 
numbers and are denoted by M, in hónor of Mersenne, who studied them in 
1644. It is known that M, is prime for the 24 primes listed above and com- 
posite for all other values of p < 257, excépt possibly for 


p = 157, 167, 193, 199, 227, 229; 


for these it isnot yet known whether M, is prime or composite. 
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No odd perfect numbers are known; it is not.even known if any exist. 
But if any do exist they must be very large; in fact, greater than 10°° (see 
Hagis [29]). 

We turn now to a brief description of the history of the theory of numbers 
since Euclid’s time. 

After Euclid in 300 Bc no significant advances were made in number 
theory until about aD 250 when another Greek mathematician, Diophantus 
of Alexandria, published 13 books, six of which have been preserved. This 
was the first Greek work to make systematic use of algebraic symbols. 
Although his algebraic notation seems awkward by present-day standards, 
Diophantus was able to solve certain algebraic equations involving two or 
three unknowns. Many of his problems originated from number theory and it 
was natural for him to seek integer solutions of equations. Equations to be 
solved with integer values of the unknowns are now called Diophantine 
equations, and the study of such equations is known as Diophantine analysis. 
The equation x? + y? = z? for Pythagorean triples is an example of a 
Diophantine equation. 

After Diophantus, not much progress was made in the theory of numbers 
until the seventeenth century, although there is some evidence that the 
subject began to flourish in the Far East—especially in India—in the period 
between AD 500 and ap 1200. 

In the seventeenth century the subject was revived in Western Europe, 
largely through the efforts of a remarkable French mathematician, Pierre de 
Fermat (1601-1665), who is generally acknowledged to be the father of 
modern number theory. Fermat derived much of his inspiration from the 
works of Diophantus. He was the first to discover really deep properties of 
the integers. For example, Fermat proved the following surprising theorems: 
^ Every. integer is either a triangular number or a sum of 2 or 3 triangul 
numbers; évery integer is either a square or a sum of 2, 3, or 4 iieri dioe 
integer is either a pentagonal number or the sum of 2, 3, 4, or 5 pentagonal 
numbers, and so on. 

Fermat also discovered that every prime number of the form 4n 4- 1 
such as 5, 13, 17, 29, 37, 41, etc., is a sum of two squares. For example, 


52142, 12243, 17-744, 2922.45. 
3721^'--60,  41-4?4- 52, 


Shortly after Fermat's time, the names of Euler (1707-1783), Lagrange 
(1736-1813), Legendre (1752-1833), Gauss (1777-1855), and Dirichlet 
(1805-1859) became prominent in the further development of the subject. 
The first textbook in number theory was published by Legendre in 1798. 
Three years later Gauss published Disquisitiones Arithmeticae, a book which 
transformed the subject into a systematic and beautiful science. Although he 
made a wealth of contributions to other branches of mathematics, as well 
as to other sciences, Gauss himself considered his book on number theory 
to be his greatest work. 
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In the last hundred years or so since Gauss’s time there has been an 
intensive development of the subject in many different directions. It would be 
impossible to give in a few pages a fair cross-section of the types of problems 
that are studied in the theory of numbers. The field is vast and some parts 
require a profound knowledge of higher mathematics. Nevertheless, there 
are many problems in number theory which are very easy to state. Some of 
these deal with prime numbers, and we devote the rest of this introduction 
to such problems. 

The primes less than 100 have been listed above. A table listing all primes 
less than 10 million was published in 1914 by an American mathematician, 
D. N. Lehmer [43]. There are exactly 664,579 primes less than 10 million, 
or about 6}%. More recently D. H. Lehmer (the son of D. N. Lehmer) 
calculated the total number of primes less than 10 billion; there are exactly 
455,052,512 such primes, or about 4197, although all these primes are not 
known individually (see Lehmer [41]). 

A close examination of a table of primes reveals that they are distributed 
in a very irregular fashion. The tables show long gaps between primes. For 
example, the prime 370,261 is followed by 111 composite numbers. There are 
no primes between 20,831,323 and 20,831,533. It is easy to prove that arbitrar- 
ily large gaps between prime numbers must eventually occur. 

On the other hand, the tables indicate that consecutive primes, such as 
3 and 5, or 101 and 103, keep recurring. Such pairs of primes which differ 
only by 2 are known as twin primes. There are over 1000 such pairs below 
100,000 and over 8000 below 1,000,000. The largest pair known to date 
(see Williams and Zarnke [76]) is 76 - 3!?? — 1 and 76-3'3° + 1. Many 
mathematicians think there are infinitely many such pairs, but no one has 
been able to prove this as yet. 

One of the reasons for this irregularity in distribution of primes is that no 


simple formula exists for producing all the primes. Some formulas do yield 
many primes. For example, the expression 


x?!— x41 


gives a prime for x = 0, 1, 2,..., 40, whereas 
x? — 79x + 1601 


gives a prime for x = 0, 1, 2,..., 79. However, no such simple formula can 
give a prime for all x, even if cubes and higher powers are used. In fact, in 
1752 Goldbach proved that no polynomial in x with integer coefficients can 
be prime for all x, or even for all sufficiently large x. 

Some polynomials represent infinitely many primes. For example, as 
x runs through the integers 0, 1, 2, 3,..., the linear polynomial 


2x +1 
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gives all the odd numbers hence infinitely many primes. Also, each of the 
polynomials 
4x 1 and 4x 43 


represents infinitely many primes. In a famous memoir [15] published in 
1837, Dirichlet proved that, if a and b are positive integers with no prime 
factor in common, the polynomial 


ax +b 


gives infinitely many primes as x runs through all the positive integers. 
This result is now known as Dirichlet's theorem on the existence of primes 
in a given arithmetical progression. 

To prove this theorem, Dirichlet went outside the realm of integers and 
introduced tools of analysis such as limits and continuity. By so doing he 
laid the foundations for a new branch of mathematics called analytic number 
theory, in which ideas and methods of real and complex analysis are brought 
to bear on problems about the integers. 

It is not known if there is any quadratic polynomial ax? + bx + c with 
a #0 which represents infinitely many primes. However, Dirichlet [16] 
used his powerful analytic methods to prove that, if a, 2b, and c have no 
prime factor in common, the quadratic polynomial in two variables 


ax? + 2bxy + cy? 


represents infinitely many primes as x and yrun through the positive integers. 

Fermat thought that the formula 2?" + 1 would always give a prime for 
n — 0, 1, 2,... These numbers are called Fermat numbers and are denoted 
by F,. The first five are 


Fo=3, Fi=5, F,=1], F,=257 and F, = 65,537, 


and they are all primes. However, in 1732 Euler found that F5 is composite; 
in fact, 
F, = 2? + 1 = (641)(6,700,417). 


These numbers are also of interest in plane geometry. Gauss proved that if 
F, is a prime, say Fn = P, then a regular polygon of p sides can be con- 
structed with straightedge and compass. 

Beyond F;, no further Fermat primes have been found. In fact, for 5 « 
n € 16 each Fermat number F, is composite. Also, F, is known to be com- 
posite for the following further isolated values of n: 


n = 18,19,21,23,25, 26, 27, 30, 32, 36, 38, 39, 42, 52, 55, 58, 63, 73, 77, 
81, 117, 125, 144, 150, 207, 226, 228, 260, 267, 268, 284, 316, 452, 
and 1945. 
The greatest known Fermat composite, F 1945, has more than 105*? digits, a 


number larger than the number of letters in the Los Angeles and New York 
telephone directories combined (see Robinson [59] and Wrathall [77]). 
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It was mentioned earlier that there is no simple formula that gives all the 
primes. In this connection, we should mention a result discovered in 1947 
by an American mathematician, W. H. Mills [50]. He proved that there is 
some number A, greater than 1 but not an integer, such that 


[4?] is prime for all x = 1,2, 3,... 


Here [47] means the greatest integer < 4?'. Unfortunately, no one knows 
what A is equal to. n 

The foregoing results illustrate the irregularity of the distribution of the 
prime numbers. However, by examining large blocks of primes one finds 
that their average distribution seems to be quite regular. Although there is 
no end to the primes, they become more widely spaced, on the average, as 
we go further and further in the table. The question of the diminishing 
frequency of primes was the subject of much s 


peculation in the early nine- 
teenth century. To study this distribution, we consider a function, denoted 


by x(x), which counts the number of primes <x. Thus, 
n(x) = the number of primes p satisfying 2 < p <x. 
Here is a brief table of this functio. 


n and its comparison with x/log x, where 
log x is the natural logarithm of x. 


x 
x n(x) x/log x ef z 
10 4 4.3 0.93 
10? 25 21.7 1.15 
10? 168 144.9 1.16 
10* 1,229 1,086 1.11 
105 9,592 8,686 1.10 
10° 78,498 72,464 1.08 
107 664,579 621,118 1.07 
108 5,761,455 5,434,780 1.06 
10° 50,847,534 48,309,180 1.05 


10:9 455,052,512 434,294,482 1.048 


By examining a table like this for x < 105, Gauss 
proposed independently that for large x the ratio 


se) | = x 


was nearly 1 and they conjectured that this ratio would approach 1 as x 
approaches oo. Both Gauss and Legendre attempted to prove this statement 
but did not succeed. The problem of deciding the truth or falsehood of this 


[24] and Legendre [40] 
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conjecture attracted the attention of eminent mathematicians for nearly 
100 years. 

In 1851 the Russian mathematician Chebyshev [9] made an important 
step forward by proving that if the ratio did tend to a limit, then this limit 
must be 1. However he was unable to prove that the ratio does tend to a 
limit. 

In 1859 Riemann [58] attacked the problem with analytic methods, using 
a formula discovered by Euler in 1737 which relates the prime numbers to 
the function 


: w= Y 


for real s > 1. Riemann considered complex values of s and outlined an 
ingenious method for connecting the distribution of primes to properties 
of the function ¢(s). The mathematics needed to justify all the details of his 
method had not been fully developed and Riemann was unable to com- 
pletely settle the problem before his death in 1866. 

Thirty years later the necessary analytic tools were at hand and in 1896 
J. Hadamard [28] and C. J. de la Vallée Poussin [71] independently and 
almost simultaneously succeeded in proving that 


lim n(x)log x i 


xi ux 


1. 


This remarkable result is called the prime number theorem, and its proof was 
one of the crowning achievements of analytic number theory. 

In 1949, two contemporary mathematicians, Atle Selberg [62] and Paul 
Erdés [19] caused a sensation in the mathematical world when they dis- 
covered an elementary proof of the prime number theorem. Their proof, 
though very intricate, makes no use of ¢(s) nor of complex function theory 
and in principle is accessible to anyone familiar with elementary calculus. 

One of the most famous problems concerning prime numbers is the 
so-called Goldbach conjecture. In 1742, Goldbach [26] wrote to Euler 
suggesting that every even number >4 is a sum of two primes. For example 


422-42, 6=3 + 3, 8-345, 
1023472545, 122547. 


This conjecture is undecided to this day, although in recent years some 
progress has been made to indicate that it is probably true. Now why do 
mathematicians think it is probably true if they haven't been able to prove it? 
First of all, the conjecture has been verified by actual computation for all 
even numbers less than 33 x 105. It has been found that every even number 
greater than 6 and less than 33 x 10° is, in fact, not only the sum of two odd 
primes but the sum of two distinct odd primes (see Shen [66]). But in number 
theory verification of a few thousand cases is not enough evidence to con- 
vince mathematicians that something is probably true. For example, all the 
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odd primes fall into two categories, those of the form 4n + land those of the 
form 4n + 3. Let z,(x) denote all the primes <x that are of the form 4n +d, 
and let 3(x) denote the number that are of the form 4n + 3. It is known that 
there are infinitely many primes of both types. By computation it was found 
that z,(x) < 73(x) for all x < 26861. But in 1957, J. Leech [39] found that 
for x = 26,861 we have z,(x) = 1473 and T(x) = 1472, so the inequality 


was reversed. In 1914, Littlewood [49] proved that this inequality reverses 


back and forth infinitely often. That is, there are infinitely many x for which 


m(x) < 73(x) and also infinitely many x for which z;(x) < z,(x). Con- 


jectures about prime numbers can be erroneous even if they are verified by 
computation in thousands of cases. 


Therefore, the fact that Goldbach’s conjecture has been verified for all 
even numbers less than 33 x 105 


is only a tiny bit of evidence in its favor. 


N=PitP2+-+--+py (for sufficiently large n). 


If we knew that M were €qual to 2 for all even n, this would prove Goldbach's 


conjecture for all sufficiently large n. In 1956 the Chinese mathematician 
Yin Wen-Lin [78] proved that M < 18. That is 


Progress made on this problem in nearly 200 years. 

A much closer approach to a solution of Goldbach's problem was made 
in 1937 by another Russian mathematician, I. M. Vinogradov [73], who 
proved that from some point on every odd number is the sum of three primes: 


n= p, + p + p, (n odd, n sufficiently large). 
In fact, this is true for all odd n greater than 335 
this is the strongest piece of evidence in favor o 
one thing, it is easy to prove that Vinogradov’s 
Goldbach’s statement. That is, if Goldbach’s ji 
easy to deduce Vinogradov's statement. The big achievement of Vinogradov 
was that he was able to prove his result withou 
Unfortunately, no one has been able to work i 


Another piece of evidence in favor of Goldbach's co 
in 1948 by the Hungarian mathematician Rényi [57] wh 
is a number M such that every sufficiently large eve 
written as a prime plus another number which has no 
factors: 


njecture was found 
© proved that there 
n number n can be 
more than M prime 


n=p+A 
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where A has no more than M prime factors (n even, n sufficiently large). 
If we knew that M = 1 then Goldbach’s conjecture would be true for all 
sufficiently large n. In 1965 A. A. Buhstab [6] and A. I. Vinogradov [72] 
proved that M < 3, and in 1966 Chen Jing-run [10] proved that M < 2. 

We conclude this introduction with a brief mention of some outstanding 
unsolved problems concerning prime numbers. 


1. (Goldbach’s problem). Is there an even number >2 which is not the 
sum of two primes? " a 

2. Is there an even number >2 which is not the difference of two primes? 

3. Are there infinitely many twin primes? 

4. Are there infinitely many Mersenne primes, that is, primes of the form 

2? — 1 where p is prime? 

. Are there infinitely many composite Mersenne numbers? 

6. Are there infinitely many Fermat primes, that is, primes of the form 
2?" +41? : 

7. Are there infinitely many composite Fermat numbers? 

8. Are there infinitely many primes ofthe form x? + 1, where x is an integer? 
(It is known that there are infinitely many of the form x? + y?, and of the 
form x? + y? + 1, and of the form x? + y? + z? + 1). 

9. Are there infinitely many primes of the form x? + k, (k given)? 

10. Does there always exist at least one prime between n? and (n + 1)? for 
every integer n > 1? : 

11. Does there always exist at least one prime between n? and n? 4- n for 
every integer n > 1? 

12. Are there infinitely many primes whose digits (in base 10) are all ones? 
(Here are two examples: 11 and 11,111,111,111,111,111,111,111,) 


Iz 


The professional mathematician is attracted to number theory because 
of the way all the weapons of modern mathematics can be brought to bear on 
its problems. As a matter of fact, many important branches of mathematics 
had their origin in number theory. For example, the early attempts to prove 
the prime number theorem stimulated the development of the theory of 
functions of a complex variable, especially the theory of entire functions. 
Attempts to prove that the Diophantine equation x" + y" = z" has no 
nontrivial solution if n 2 3 (Fermat's conjecture) led to the development of 
algebraic number theory, one of the most active areas of modern mathe- 
matical research. Even though Fermat's conjecture is still undecided, this 
seems unimportant by comparison to the vast amount of valuable mathe- 
matics that has been created as a result of work on this conjecture. Another 
example is the theory of partitions which has been an important factor in the 
development of combinatorial analysis and in the study of modular functions 

There are hundreds of unsolved problems in number theory. New 
problems arise more rapidly than the old ones are solved, and many of th 
ge zs ; have paie unsolved for centuries. As the mathematician, 

ierpinski once said, "...the 
p progress of our knowledge of numbers is 
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advanced not only by what we already know about the: 
what we yet do not know about them.” 

Note. Every serious student of number theory should become acquainted 
with Dickson’s three-volume History of the Theory of Numbers [13], and 
LeVeque's six-volume Reviews in Number Theory [45]. Dickson's History 
gives an encyclopedic account of the entire literature of number theory up 
until 1918. LeVeque's volumes reproduce all the reviews in Volumes 1-44 of 
Mathematical Reviews (1940-1972) which bear directly on questions com- 
monly regarded as part of number theory. These two valuable collections 


provide a history of virtually all important discoveries in number theory from 
antiquity until 1972, 


m, but also by realizing 


The Fundamental Theorem of 
Arithmetic 


1.1 Introduction 


This chapter introduces basic concepts of elementary number theory such 
as divisibility, greatest common divisor, and prime and composite numbers. 
The principal results are Theorem 1.2, which establishes the existence of 
the greatest common divisor of any two integers, and Theorem 1.10 (the 
fundamental theorem of arithmetic), which shows that every integer greater 
than 1 can be represented as a product of prime factors in only one way 
(apart from the order of the factors). Many of the proofs make use of the 
following property of integers. 


The principle of induction 7f Q is a set of integers such that 


(a) 1€Q, 
(b) n € Q implies n + 1€ Q, 


then ; 
(c) all integers > 1 belong to Q. 


There are, of course, alternate formulations of this principle. For example, 
in statement (a), the integer 1 can be replaced by any integer k, provided that 
the inequality >1 is replaced by >k in (c). Also, (b) can be replaced by the 
statement 1, 2, 3, ...,n € Q implies (n + 1) € Q. 

We assume that the reader is familiar with this principle and its use in 
proving theorems by induction. We also assume familiarity with the following 
principle, which is logically equivalent to the principle of induction. 


The well-ordering principle If A is a nonempty set of positive integers, then A 
contains a smallest member. 
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Again, this principle has equivalent formulations. For example, “positive 
integers” can be replaced by “integers > k for some k.” 


12 Divisibility , 


Notation In this chapter, small latin letters a, b, c, d, n, etc., denote integers; 
they can be positive, negative, or zero. 
Definition of divisibility We say d divides n a; 
for some c. We also say that n is a mul 
or that d is a factor of n. If d does not 


nd we write d|n whenever n — cd 
tiple of d, that d is a divisor of n, 
divide n we write dyn. 


Divisibility establishes a relation between any two integers with the 
following elementary Properties whose proofs we leave as exercises for the 


reader. (Unless otherwise indicated, the letters a, b, d, m,n in Theorem 1.1 
represent arbitrary integers.) 


Theorem 1.1 Divisibility has the following properties: 


(a) n|n s 
(b) d|n and n|m implies d|m 

(c) d|n and d|m implies d|(an + bm) 
(d) d|n implies ad |an 

(e) ad|an and a 0 implies d|n 

(f) 1|n 
(g) n|O 
(h) 0|n implies n = 0 

(i) d|nandn #0 implies |d| < |n] 
(3) d|n and n|d implies |d| — |n| 
(k) din and d #0 implies (n/d)|n. 


(reflexive property) 
(transitive property) 
(linearity property) 
(multiplication Property) 
(cancellation law) 

(1 divides every integer) 
(every integer divides zero) 
(zero divides only zero) 
(comparison Property) 


Note. If d|n then n/d is called the divisor conjugate to d. 


1.3 Greatest common divisor 


If d divides two integers a and b, then d is called à common divisor of a and b. 
Thus, 1 is a common divisor of every pair of integers a and b. We prove now 
that every pair of integers a and b has a common divisor which can be ex. 
pressed as a linear combination of a and b. 


Theorem 1.2 Given any two integers a and b, there is a common divisor d ofa 
and b of the form 


d= ax + by, 


1.3: Greatest common divisor 


where x and y are integers. Moreover, every common divisor of a and b 
divides this d. 


Proor. First we assume that a > 0 and b > 0. We use induction on n, 
where n = a + b: If n = 0 then a = b = 0 and we can take d = 0 with 
x = y = 0. Assume, then, that the theorem has been proved for 0, 1, 2,..., 
n — 1. By symmetry, we can assume a > b. If b = 0 take d = a, x = 1, 
y = 0. If b > 1 apply the theorem to a — b and b. Since (a — b) + b = 
a-—n-— b € n — 1, the induction assumption is applicable and there is a 
common divisor d of a — b and b of the form d = (a — b)x + by. This d 
also divides (a — b) + b = a so d is a common divisor of a and b and we 
have d = ax + (y — x)b, a linear combination of a and b. To complete the 
proof we need to show that every common divisor divides d. But a common 
divisor divides a and b and hence, by linearity, divides d. 

If a < Oor b < 0 (or both), we can apply the result just proved to |a| and 
|b|. Then there is a common divisor d of |a| and |b| of the form 


d = |a|x + |bly. 


If a < 0, |a|x = —ax = a(— x). Similarly, if b < 0, |b|y = b(— y). Hence d 
is again a linear combination of a and b. Oo 


Theorem 1.3 Given integers a and b, there is one and only one number d with 
the following properties: 


(a) d20 (d is nonnegative) 
(b) d|a and d|b (d is a common divisor of a and b) 
(c) ela and e|b implies e|d (every common divisor divides d). 


Pnoor. By Theorem 1.2 there is at least one d satisfying conditions (b) and (c). 
Also, —d satisfies these conditions. But if d' satisfies (b) and (c), then d|d’ 
and d'|d, so |d| = |4'|. Hence there is exactly one d > 0 satisfying (b) 
and (c). s O 


Note. In Theorem 1.3, d = 0 if, and only if, a = b = 0. Otherwise d > 1. 

Definition The number d of Theorem 1.3 is called the greatest la 
divisor (gcd) of a and b and is denoted by (a, b) or byl aDb. If (a, b) = 
then a and b are said to be relatively prime. a 


The notation aDb arises from interpreting the gcd as an operation per- 
formed on a and b. However, the most common notation in use is (a, b) and 
this is the one we shall adopt, although in the next theorem we also use the 
notation aDb to emphasize the algebraic properties of the operation D. 
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Theorem 1.4 The gcd has the following properties: 
(a) (a, b) = (b, a) 


.  aDb = bDa b (commutative law) 
(b) (a, (b, c)) = ((a, b), c) 

aD(bDc) = (aDb)Dc (associative law) 
(c) (ac, bc) = |c|(a, b) , 

(ca)D(cb) = |c|(aDb) (distributive law) 
(d) (a, 1) = (1, a) = 1, (a, 0) = (0, a) —.la|. 

aDli=1Da=1, aD0=0Da=|al. 


ProoF. We prove only (c). Proofs of the other statements are left as exercises 
for the reader. 


Let d — (a, b) and let e — (ac, bc). We wish to prove that e — |c|d. Write 
d = ax + by. Then we have 


(1) N cd = acx + bey. 
Therefore cd|e because.cd divides both ac and bc. Also, Equation (1) shows 
that e|cd because e|ac and e|bc. Hence |e| = |cd|, or e = |c|d. ud 


Theorem 1.5 Euclid's lemma. If a|bc and if (a, b) = 1, then a|c. 


Pnoor. Since (a, b) = 1 wecan writel = ax + by. Thereforec = acx + bey. 
But a|acx and a|bcy, so a|c. 


1.4 Prime numbers 


Definition An integer n is called prime if n > 1 and if the only positive 
divisors of n are 1 and n: If n > 1 and if n is not prime, then n is called 
composite. 


EXAMPLES The prime numbers less than 100 are 2, 3, 5, 7, 11, 13, 17, 19, 23, 
29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, and 97. 


Notation Prime numbers are usually denoted by p, p’, Di» d: d^, di 


LJ i 
Theorem 1.6 Every integer n > 1 is either a prime number or a product of 
prime numbers. 


Proor. We use induction on n. The theorem is clearly true for n = 2. Assume 
it is true for every integer <n. Then if n is not prime it has a positive divisor 
d # 1,d # n. Hencen = cd, where c + n. But both cand dare <nand >1 
so each of c, d is a product of prime numbers, hence so is n. o 


Theorem 1.7 Euclid. There are infinitely many prime numbers. 
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Euc.ip’s PRoor. Suppose there are only a finite number, say p1, 2s - - - » Pn 
Let N = 1 + pipa 777 Pa: Now N >.1 so either N is prime or N isa product 
of primes. Of course N is not prime since it exceeds each p;. Moreover, 
no p; divides N (if p;|N then p; divides the difference N — pı P2 ^^ Ps = 1 

This contradicts Theorem 1.6. o 


Theorem 1.8 If a prime p does not divide a, then (p, a) = 1. 


Proor. Let d = (p, à). Then d|p so d= lor d = p. But dla so d +p 
because p ¥ a. Hence a= I, o 


Theorem 1.9 If a prime p divides ab, then p|a or p|b. More generally, if a 
prime p divides a product a, * * Gy, then p divides at least one of the factors. 


Proor. Assume p ab and that p ¥ a. Weshall prove that p|b. By Theorem 1.8, 
(p, a) = 1 so, by Euclid’s lemma, p|b. 

To prove the more general statement we use induction on n, the number of 
factors. Details are left to the reader. 


1.5 The fundamental theorem of arithmetic 


Theorem 1.10 Fundamental theorem:of arithmetic. Every integer n > 1 can 
be represented as a product of prime factors in only one way, apart from the 
order of the factors. 


Pnoor. We use induction on n. The theorem is true for n — 2. Assume, then, 
that it is true for all integers greater than 1 and less than n. We shall prove 
itis also true for n. If nis prime there is nothing more to prove. Assume, then, 
that n is composite and that n has two factorizations, say - 


Q) n = pip: Ps = dido de 
We wish to show that s — t and that each p equals some q. Since p, divides 


the product 4192 * ** d it must divide at least one factor. Relabel q1, 42» . - -» 4 
so that p, |q;. Then pi = di since both p, and q, are primes. In (2) we may 


cancel p, on both sides to obtain 
n/p, = P2777 Ps = d2777 de 


Ifs>lort>1 then 1 < n/p, < n. The induction hypothesis tells us that 
the two factorizations of n/p, must be identical, apart from the order of the 
factors. Therefore s = t and the factorizations in (2) are also identical, apart | 
from order. This completes the proof. _ e 


Note. In the factorization of an integer n, a particular prime p may occur 
more than once. If the distinct prime factors of n are p,,-.-,P, and if p; 
occurs as a factor a; times, we can write 


n= pit Pe 
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or, more briefly, 


n= I pj". 


This is called the factorization of n into Prime powers. We can also express 1 
in this form by taking each exponent a; to be 0, 
Theorem 1.11 If n = Iri pe. the set of positive divisors of n is the set of 

numbers of the form | Ji- , pi“, where 0 < c, < a; for i = 1,2,. 


Proor. Exercise. 


Sere 


B d 
Note. If we label the primes in increasing order, thus: 
P; = 2, P = 3, P3 = 5,..., Pn = the nth prime, 


every positive integer n (including 1) can be expressed in the form 
n= |p 
i=1 


ponent a; > 0. The positive divisors of n are all numbers of 


where now each ex 
the form 


where 0 < c, < a;. The products are, of course, finite. 
Theorem 1.12 Jf two Positive integers a and b have the factorizations 
o Li 
amm II Pi", b= II pi^, 
i=1 i=1 
then their gcd has the factorization 


(a,b) = [Ipi 
i=1 


< a; and c; < b, we have d|a and d|b sod 
1s à common divisor of a and b. Let e be any 


à common divisor of a and b, and 
write e = [T2 , pj. Then e; < a; and e; < b; so e, < c;. Hence e|d, so d is 
the gcd of a and b. o 


1.6 The series of reciprocals of the primes 


Theorem 1.13 The infinite series S. 


1l/p, diverges, - 
ProoF. The following short 


r Proof of this theorem 
We assume the series cony, 


is due to Clarkson [1 
erges and obtain a co} Ed 


ntradiction, If the series 
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converges there is an integer k such that 


c al 
—« 
m=k+1 Pm 


NI = 


Let Q = p, --: px, and consider the numbers 1 + nQ for n = 1,2,... None 
of these is divisible by any of.the primes p,, ..., px- Therefore, all the prime 
factors of 1 + nQ occur among the primes p, 41, Py+2,--- Therefore for each 


r > 1 we have 
Y 1 o ( LJ 1 t 
—— —], 
n=11+nQ à D a 


* since the sum on the right includes among its terms all the terms on the left. 
But the right-hand side of this inequality is dominated by the convergent 


geometric series 
o 1 t 
PIO] í 


Therefore the series YS 1/(1 + nQ) has bounded partial sums and hence 
converges. But this is a contradiction because the integral test or the limit 
comparison test shows that this series diverges. 


Note. The divergence of the series 5^ 1/p, was first proved in 1737 by 
Euler [20] who noted that it implies Euclid's theorem on the existence of 
infinitely many primes. 


In a later chapter we shall obtain an asymptotic formula which shows 
that the partial sums Y5- , 1/p, tend to infinity like log(log n). 


1.7 The Euclidean algorithm 


Theorem 1.12 provides a practical method for computing the gcd (a, b) when 
the prime-power factorizations of a and b are known. However, considerable 
calculation may be required to obtain these prime-power factorizations and 
it is desirable to have an alternative procedure that requires less computa- 
tion. There is a useful process, known as Euclid's algorithm, which does not 
require the factorizations of a and b. This process is based on successive 
divisions and makes use of the following theorem. 


Theorem 1.14 The division algorithm. Given integers a and b with b > 0, there 
exists a unique pair of integers q and r such that 
a=bq+r, with0 <r « b. 


Moreover, r = 0 if, and only if, b|a. 


Note. We say that q is the quotient and r the remainder obtained when b 
is divided into a. 


19 


1: The fundamental theorem of arithmetic 


Pnoor. Let S be the set of nonnegative integers given by 
S = {y:y =a — bx, x is an integer, y > 0}. 


This is a nonempty set of nonnegative integers so it has a smallest member, 
say a — bq. Let r = a — bq. Then a = bq + r and r > 0. Now we show that 
r<b. Assume r >b. Then 0<r—b<r. But r—beS since r— b = 
a — b(q + 1). Hencer — bisa member of S smaller than its smallest member, 
r. This contradiction shows that r < b. The pair q, r is unique, for if there were 
another such pair, say q', r', then bg +r =hq' +r’ so b(q — q) = r — r. 
Hence b|(r' — r). If r' — r #0 this implies b € |r — r'|, a contradiction. 
Therefore r' = r and q' = q. Finally, it is clear that r = 0 if, and only if, 
b|a. , o 


Note. Although Theorem 1.14 is an existence theorem, its proof actually 
gives us a method for computing the quotient q 2nd the remainder r. We 
subtract from a (or add to a) enough multiples ‘of b until it is clear that we have 
obtained the smallest nonnegative number of the form a — bx. 


Theorem 1.15 The Euciidean algorithm. Given positive integers a and b, where 
b ¥ a. Let ro = a, r, = b, and apply the division algorithm repeatedly to 
obtain a sei of remainders r;, rs, . . . , r,, r, 4, defined successively by the 


relations 
To — ridi t r2, 0<r <r; 
T= To E fas *O<r3 <r, 
Tn-2 = Fn-iQg-i + Try ORT Taa 


Tn-i = Taan Taris y Tati =O 


Thenr,, the last nonzero remainder in this process, is (a, b), the gcd of aand b. 
Proor. There is a stage at which r,,, = 0 because the r; are decreasing 
and nonnegative. The last relation, r,_; = Tn Qn Shows that r,|r, . 1: The next 
to last shows that r,|r,-2. By induction we see that r, divides each r;. In 
particular r,|"; = b and r,|ro = a, so r, is a common divisor of a and b. 
Now let d be any common divisor of a and b. The definition of 72 Shows that 


d|r;. The next relation shows that d|r3. By induction, d divides each ri SO 
d|r,. Hence r, is the required ged. 


E 
1.8 The greatest common divisor of more 
than two numbers 
The greatest common divisor of three integers a, b, c is denoted by (a, b, c) and 
is defined by the relation 


(a, b, c) = (a, (b, c). 
by D CAE de e Ce 


Exercises for Chapter 1 


By Theorem 1.4(b) we have (a, (b, c)) = (a, b), c) so the gcd depends only on 
a, b, c and not on the order in which they are written. 

Similarly, the gcd of n integers ay, . .., a, is defined inductively by the 
relation 


(a, «+++ Gn) = (a1, (a2, - - -> Gn))- 
Again, this number is independent of the order in which the a; appear. 
If d = (a,,..., An) it is easy to verify that d divides each of the a; and that 


every common divisor divides d. Moreover, d is a linear combination of the 
a;. That is, there exist integers x;, .--, X, such that " 


(a, ...,a,) = 3X1. + + A, Xy- 


Ifd = 1 the numbers are said to be relatively prime. For example, 2, 3, and 10 
are relatively prime. 

If(a;, aj) = 1 whenever i # j the numbers a, ..., a, are said to be relatively 
prime in pairs. If a, . . . , a, are relatively prime in pairs then (a;, ..., a,) = 1. 
However, the example (2, 3, 10) shows that the converse is not necessarily 
true. 


Exercises for Chapter 1 


In these exercises lower case latin letters a, b, c, ..., x, y, z represent integers. 
Prove each of the statements in Exercises 1 through 6. 


If (a, b) = 1 and if cla and dļb, then (c, d) = 1. 

If (a, b) = (a, c) = 1, then (a, bc) = 1. 

If (a, b) = 1, then (a^, b^) = 1 for all n > 1, k > 1. 

If (a, b) = 1, then (a + b, a — b) is either 1 or 2. 

If (a, b) = 1, then (a + b, a? — ab + b?) is either 1 or 3. 
If (a, b) = L and if d|(a + b), then (a, d) = (b, d) = 1. 


SERA e y ge qp 


A rational number a/b with (a, b) = 1 is called a reduced fraction. If the sum of two 
reduced fractions is an integer, say (a/b) + (c/d) = n, prove that |b| = |d|. 


8. An integer is called squarefree if it is not divisible by the square of any prime. Prove 
that for every n > 1 there exist uniquely determined a > 0 and b > 0 such that 
n = a?b, where b is squarefree. 


9. For each of the following statements, either give a proof or exhibit a counter example. 
(a) If b?|n and a?|n and a? < 6’, then a|b. 
(b) If b? is the largest square divisor of n, then a*|n implies a|b. 


10. Given x and y, let m = ax + by, n = cx + dy, where ad — bc = +1. Prove that 
(m, n) — (x, y). 


11. Prove that n* + 4 is composite if n > 1. 


QL. 1K 
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1: The fundamental theorem of arithmetic 


In Exercises 12, 13, and 14, a, b, c, m, n denote positive integers. 


12. 


13. 


14. 


15. 
16. 


18. 


21. 


23. 


24. 


22 


For each of the following statements either give a proof or exhibit a counter example. 


(a) If a"|b" then a|b. | 
(b) If n"|m™ then n|m. | 
(c) If.a"|2b" and n > 1, then a|b. 


If (a, b) — 1 and (a/b)" — n, prove that b — 1. 
(b) If n is not the mth power of a positive integer, prove that n!/" is irrational. 


If (a, b) = tand ab = c", prove that a = x" and b = y" for some x and y. [Hint: 
Consider d — (a, c).] 


Prove that every n > 12 is the sum of two composite numbers. 


Prove that if 2" — 1 is prime, then n is prime. 


- Prove that if 2" 4- 1 is prime, then n is a power of 2. 


> 
Ifm # n compute the gcd (a?” + 1, à?" + 1) in terms of a. [Hint: Let A, =a" +1 
and show that A,|(A,, — 2) ifm > n] 


. The Fibonacci sequence 1, 1,2, 3, 5,8, 13,21, 34, ... is defined by the recursion formula 


an+1 = An + 4,1, With a, = a; = 1. Prove that (a,, a,4) = 1 for each n. 


Let d = (826, 1890). Use the Euclidean al 


gorithm to compute d, then express d as a 
linear combination of 826 and 1890. 


The least common multiple (Icm) of two integers a and b is denoted by [a, b] or by 
aMb, and is defined as follows: 


[a, b] = |ab|/(a, b) ifa 0 and b 0, 
[2,5] 20 ifa-0orb- 0. 


Prove that the lcm has the following properties:- 
(a) Ifa = [] #2, p; andb = Iz 


21 p then [a, b] = [Tz., p wh = i bj}. 
(b) (aDb)Mc = (aMc)D(bMc). e) 7 [fap where ey = maxta, b) 
(c) (aMb)De = (aDc)M(bDc). 


e 


(D and M are distributive with respect to each other) 


. Prove that (a, b) = (a + b, [a, b]). M 


The sum of two positive inte; 


: gers is 5264 and their least common multiple is 200,340. 
Determine the two integers, 


Prove the following multiplicative Property of the ged 


(ah, bk) = (a, b (ratos TU 
ares am (h, k)/A(a, b)’ (h, Ky’ 


In particular this shows that (ah, bk) = (a, k)(b, h) whenever (a,b) = (h k) — 1. 


Exercises for Chapter 1 


Prove each of the statements in Exercises 25 through 28. All integers are 
positive. 


25. If (a, b) = 1 there exist x > 0 and y > 0 such that ax — by = 1. 


26. If (a, b) = 1 and x? = y’ then x = n? and y = n° for some n. [Hint : Use Exercises 25 
and 13.] 


27. (a) If (a, b) = 1 then for every n > ab there exist positive x and y such that n = 
ax + by. 
(b) If (a, b) = 1 there are no positive x and y such that ab = ax + by. 


28. If a > 1 then (a" — 1, a" — 1) = a™™ — 1, 


29. Given n > 0, let S be a set whose elements are positive integers «2n such that if a 
and barein S anda # b then a ¥ b. What is the maximum number of integers that S. 
can contain? [Hint: S can contain at most one of the integers 1, 2, 22, 23,..., at 
most one of 3, 3-2, 3- 27, ..., etc.] 


30. If n > 1 prove that the sum - 


is not an integer. 
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Arithmetical Functions and 
Dirichlet Multiplication 


2.1 Introduction 


Number theory, like many other branches of mathematics, is often concerned 
with sequences of real or complex numbers. In number theory such sequences 
are called arithmetical functions. 


Definition A real- or complex-valued function defined on the positive 
integers is called an arithmetical function or a number-theoretic function. 


This chapter introduces several arithmetical functions which play an 
important role in the study of divisibility properties of integers and the 
distribution of primes. The chapter also discusses Dirichlet multiplication, 
a concept which helps clarify interrelationships between various arith- 
metical functions. 

We begin with two important examples, the Móbius function p(n) and 
the Euler totient function q(n). 


22 The Möbius function p(n) 
Definition The Möbius function j is defined as follows: 


A1) = 1; 
If n > 1, write n = p,^ --- p,*. Then 
p(n) = (71 ifa, — a; =--- — a, = 1, 


p(n) = 0 otherwise. 


Note that p(n) = 0 if and only if n has a square factor > 1. 
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23: The Euler totient function y(n) 


Here is a short table of values of p(n): 


pin): 1 —1 -1 0 -1 1 -1 0 0 il 
The Móbius function arises in many different places in number theory. 
One of its fundamental properties is a remarkably simple formula for the 
divisor sum J ay 4(d), extended over the positive divisors of n. In this formula, 
[x] denotes the greatest integer <x. 


Theorem 2.1 /f n > 1 we have 


Mn] BA fis, 
z«-[]-5 ifn > 1. 


Proor. The formula is clearly true if n = 1. Assume, then, that n > 1 and 
writen = p;^' -:- p,*.In the sum Yan H(@) the only nonzero terms come from 
d — 1 and from those divisors of n which are products of distinct primes. 
Thus 


Z Hd) = p(1) + u(pi) + +++ + ipi) + (pipa) + +++ + H(PR—1Pxd) 


+++ pipa P) 


za «(ensem «(Qe - o -y-e 0 


2.3 The Euler totient function (7) 


Definition If n > 1 the Euler totient p(n) is defined to be the number of 
positive integers not exceeding n which are relatively prime to n; thus, 


W et)- Y'1, 
k=1 


where the ' indicates that the sum is extended over those k relatively 
prime to n. 


Here is a short table,of values of p(n): 


n: 1 2 3 4 5 6 7 8 9 10 
en): 1 1 2 2 4 2 6 4 6 4 


As in the case of y(n) there is a simple formula for the divisor sum J'an g(d). 
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2: Arithmetical functions and Dirichlet multiplication 


Theorem 2.2 Ifn > 1 we have 
Y old) = n. 
din 


Pnoor. Let S denote the set (,2,..., n). We distribute the integers of S into 
disjoint sets as follows. For each divisor d of n, let 


A(d) = (k:(k, n =d,1<k xn) 


That is, A(d) contains those elements of S which have the gcd d with n. 


The sets A(d) form a disjoint collection whose union is S. Therefore if AC) 
denotes the number of integers in A(d) we have 


(2) Z f(a) =n. 


But (k, n) = d if and only if (k/d, n/d) 
0 < k/d < n/d. Therefore, if we let q= 
dence between the elements in A(d)and t 


(4, n/d) = 1. The number of such q is 
becomes 


— l, and 0<k<n if and only if 
k/d, there is a one-to-one correspon- 
hose integers q satisfying 0 < q<n/d, 
9(n/d). Hence f(d) = g(n/d) and (2) 


Zold) =n. 


But this is equivalent to the statement 5^4, e(d) = n because when d runs 
through all divisors of n so does n/d. This completes the proof. uu 


2.4 A relation connecting o and y 


The Euler totient is relate 


d to the Móbius function through the following 
formula: 


Theorem 23 Ifn > | we have 


9(n) = Y, u(d) a 
d|n 


Pnoor. The sum (1) defining (n) can be rewritten in the form 
9(n) = Y rm : 
xci LG, k) 


where now k runs through all integers <n. Ni 


i Ow we use Theorem 2.1 with n 
replaced by (n, k) to obtain 


on=}_} ud)- Y Y ua) 
k=1 d|(n, k) k=1 dE 
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2.5: A product formula for o(n) 


For a fixed divisor d of n we must sum over all thosekintherangel <k € n 
which are multiples of d. If we write k = gd then 1 < k € n if and only if 
1 < q < n/d. Hence the last sum for ọ(n) can be written as 


njà njd n 
qn) = Y, ¥ ud) = Y. ud) 3.1 = Y. 5- 
din q=1 din q71 d|n 


This proves the theorem. o 


2.5 A product formula for q(n) 


The sum for q(n) in Theorem 2.3 can also be expressed as a product extended 
over the distinct prime divisors of n. 


Theorem 2.4 For n > 1 we have 


(3) g(n) 2 n TT ( - 2) 


Proor. For n = 1 the product is empty since there are no primes which 
divide 1. In this case it is understood that the product is to be assigned the 
value 1. 

Suppose, then, that n > 1 and let pı, .. ., p, be the distinct prime divisors 
of n. The product can be written as 


De 


ae les Star it 


Pi PiPj PiPjPr P1P2°** Pr à 


On the right, in a term such as Y^ 1/p;p;p; it is understood that we consider 
all possible products p;pjPx of distinct prime factors of n taken three at a 
time. Note that each term on the right of (4) is of the form +1/d where d 
is a divisor of n which is either 1 or a product of distinct primes. The numera- 
tor +1 is exactly p(d). Since ju(d) = 0 if d is divisible by the square of any p; 
we see that the sum in (4) is exactly the same as 


id) 
dad 


This proves the theorem. r1] 


Many properties of p(n) can be easily deduced from this product formula. 
Some of these are listed in the next theorem. 
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2: Arithmetical functions and Dirichlet multiplication 


Theorem 2.5 Euler’s totient has the following properties: 


(a) e(p*) = p* — p*~* for prime p and « > 1. 

(b) (mn) = (m)o(n)(d/o(4)), where d = (m, n). 

(c) e(mn) = e(m)o(n) if (m, n) = 1. 

(d) a|b implies q(a)| (b). 

(e) (n) is even for n > 3. Moreover, if n has r distinct odd prime factors, 
then 2" | (n). 


Pnoor. Part (a) follows at once by taking n — p* in (3). To prove part (b) 


we write 
1 
em) n z 3) 
n pin P 


Next we note that each prime divisor of mn is either a prime divisor of m 
or of n, and those primes which divide both m and n also divide (m, n). Hence 


1 1\ o(m) o(n) 

e. pp n 7307) See 

mn plmn P = *) ola) ; 
plim, n) p, d 


for which we get (b). Part (c) is a special case of (b). 
Next we deduce (d) from (b). Since a|b we have b = ac where 1 < c < b. 


If c= b then a = 1 and part (d) is trivially satisfied. Therefore, assume 
c < b. From (b) we have 


OL glc) 
ed - 19 o 


(5) olb) = glac) = e(a)o(c 
where d = (a, c). Now the result follows by induction on b. For b = 1 it 
holds trivially. Suppose, then, that (d) holds for all integers <b. Then it 
holds for c so q(d)| (c) since d|c. Hence the right member of (5) is a multiple 
of g(a) which means q(a)| o(b). This proves (d). 

Now we prove (e). If n = 2%, « > 2, part (a) shows that g(n) is even. If n 
has at least one odd prime factor we write 


pil. n 1 _ 
gn) “il n Iie ) ot) [1 1), 


where c(n) is an integer. The product multiplying c(n) is even so q(n) is even. 


Moreover, each odd prime p contributes a factor 2 to this product, so 2" | o(n) 
if n has r distinct odd prime factors. 
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2.6: The Dirichlet product of arithmetical functions 


2.6 The Dirichlet product of arithmetical 
functions 


In Theorem 2.3 we proved that 
n 
oln) = Y. wld) =. 
ae d 


The sum on the right is of a type that occurs frequently in number theory. 


These sums have the form 
n 
2 rax) 
d|n 


where f and g are arithmetical functions, and it is worthwhile to study some 
properties which these sums have in common. We shall find later that sums 
of this type arise naturally in the theory of Dirichlet series. It is fruitful to 
treat these sums as a new kind of multiplication of arithmetical functions, 
a point of view introduced by E. T. Bell [4] in 1915. 


Definition If fand g are two arithmetical functions we define their Dirichlet 
product (or Dirichlet convolution) to be the arithmetical function A 
defined by the equation 


h(n) = Y. rax). 
djn 


Notation We write f » g for h and (f * g) (n) for h(n). The symbol N will be 
used for the arithmetical function for which N(n) = n for all n. In this nota- 
tion, Theorem 2.3 can be stated in the form 


9-—uxN. 


The next theorem describes algebraic properties of Dirichlet multi- 
plication. 


Theorem 2.6 Dirichlet multiplication is commutative and associative. That is, 
for any arithmetical functions f, g, k we have 
f*g-g*f (commutative law) 
(f*g*k-—f*(g*k) (associative law). 


Pnoor. First we note that the definition of f * g can also be expressed as 
follows: 


(f * gn = 2 F(a)g(b), 


where a and b vary over all positive integers whose product is n. This makes 
the commutative property self-evident. 
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2: Arithmetical functions and Dirichlet multiplication 


To prove the associative property we let A = g + k and consider f « A = 
f *(g * k). We have 


(*4)- Y f(4)- Y. fla Y, gb) 


= À fawoxo. 


In the same way, if we let B = f * g and consider B * k we are led to the same 
formula for (B * k)(n). Hence f + A = B « k which means that Dirichlet 
multiplication is associative. 


We now introduce an identity element for this multiplication. 


Definition The arithmetical function J given by 


1 1 ifn =1, 
Tine H > i: ifn » 1, 
is called the identity function. 


Theorem 2.7 For all f we have I « f = f » I =f 
Proor. We have 


( * Di) => no) => LH = fin) 
d|n din n 
since [d/n] = O if d < n. 


2.7 Dirichlet inverses and the Móbius 
inversion formula 


Theorem 2.8 Jf f is an arithmetical function with SQ) # 0 there is a unique 
arithmetical function f ^ ! , called the Dirichlet inverse of f, such that 


Sf =f f= 
Moreover, f ^! is given by the recursion formulas 
l =A n 
"dM e ihm. s n|.., 
f W-;p O j rr (a) forn> 1. 


(1) am 
d<n 


Proor. Given f, we shall show that the equation (f n = In) 
unique solution for the function values f~ 
equation 


hasa 
!(n) Forn = 1 we have to solve the 


(f * F750) = r0) 
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2.7: Dirichlet inverses and the Möbius inversion formula 


which reduces to 
SOTO = 1. 


Since f (1) # 0 there is one and only one solution, namely f~*(1) = 1/f (1). 
Assume now that the function values f ^ !(k) have been uniquely determined 
for all k < n. Then we have to solve the equation (f * f ^ !)(n) = I(n), or 


2 (Q)- (a) = 0. 
din 
This can be written as 


SDS+ Y (rw =0. 
pa 


If the values f ^ !(d) are known for all divisors d < n, there is a uniquely 
determined value for f ^ ! (n), namely, 


d ES 
176-45 zo 
d<n 


since f(1) # 0. This establishes the existence and uniqueness of f ^! by 
induction. mj 


Note. We have (f * g)(1) = f(1)g(1). Hence, if f (1) # 0 and g(1) # 0 then 
(f *g)(1) # 0. This fact, along with Theorems 2.6, 2.7, and 2.8, tells us that, 
in the language of group theory, the set of all arithmetical functions f with 
f(1) # 0 forms an abelian group with respect to the operation +, the identity 
element being the function J. The reader can easily verify that 


(f*g)!-f^'*g^! if f(1) ¥ Oand g(1) 4 0. 


Definition We define the unit function u to be the arithmetical function such 
that u(n) = 1 for all n. 


Theorem 2.1 states that 3^4, u(d) = I(n). In the notation of Dirichlet 
multiplication this becomes 


Lu*uc-l. 
Thus u and p are Dirichlet inverses of each other: 
uci and yeni 
This simple property of the Móbius function, along with the associative 


property of Dirichlet multiplication, enables us to give a simple proof of the 
next theorem. 
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2: Arithmetical functions and Dirichlet multiplication 


Theorem 2.9 Möbius inversion formula. The equation 


(6) f(- 2.9 

implies 

(U) an) = Y ray. 
d|n 


Conversely, (7) implies (6). 
Poor. Equation (6) states that f = g + u. Multiplication by p gives f * # = 
(g * u) + u = g + (u + u) = g + I = g, which is (7). Conversely, multiplication 
of f * u = g by u gives (6). 


The Mobius inversion formula has already been illustrated by the pair of 
formulas in Theorems 2.2 and 2.3: 


n=}, g) = » a. 


2.8 The Mangoldt function A(n) 


We introduce next Mangoldt's function A which plays a central role in the 
distribution of primes. 


Definition For every integer n > 1 we define 


logp ifn = p" for some prime p and some m > 1, 
A(n) = : 
0 otherwise. 


Here is a short table of values of A(n): 


n: 112 3 4 5. (69 T 8 9 10 
A(n): 0 log2 log3 log2 log5 0 log7 log2 log3 0 


The proof of the next theorem shows how this function arises naturally 
from the fundamental theorem of arithmetic. 


Theorem 2.10 Jf n > 1 we have 
(8) log n = ); A(d). 


din 


Proor. The theorem is true if n = 1 since both members are 0. Therefore, 
assume that n > 1 and write 


n= [In 
k=1 
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29: Multiplicative functions 


Taking logarithms we have 
logn = > a, log pr- 
k=1 


Now consider the sum on the right of (8). The only nonzero terms in the sum 
come from those divisors d of the form p," for m = 1, 2,...,a, and k = 
1, 2, ...,r. Hence 


r a r a r 
vi A(d) — F p» A(p,") = X Y log p, — »» a, log p, — log n, 
an k=1 m=1 k=1 m=1 k=1 
which proves (8). o 
Now we use Möbius inversion to express A(n) in terms of the logarithm. 


Theorem 2.11 Zf n > 1 we have 
n 
A(n) = È u(d)og a Y. u(d)log d. 
d|n djn 


Pnoor. Inverting (8) by the Möbius inversion formula we obtain 


A(n) = » u(d)log : = logn Y ud) — x u(d)log d 


= I(n)log n — Y; u(d)log d. 
ain 


Since I(n)log n = 0 for all n the proof is complete. o 


2.9 Multiplicative functions 


We have already noted that the set of all arithmetical functions f with 
f(1) #0 forms an abelian group under Dirichlet multiplication. In this 
section we discuss an important subgroup of this group, the so-called multi- 
plicative functions. 


Definition An arithmetical function f is called multiplicative if f is not 
identically zero and if 
f(mn) = f(m)f(n) whenever (m, n) = 1. 


A multiplicative function f is called completely multiplicative if we also 
have 


f(mn) = f(m)f(n) for all m, n. 


EXAMPLE | Let f,(m) = n^, where « is a fixed real or complex number. This 
function is completely multiplicative. In particular, the unit function u — fo 
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2: Arithmetical functions and Dirichlet multiplication 


is completely multiplicative. We denote the function f, by N* and call it the 
power function. 


EXAMPLE 2 The identity function [(m) = [1/n] is completely multiplicative. 


EXAMPLE 3 The Móbius function is multiplicative but not completely multi- 
plicative. This is easily seen from the definition of y(n). Consider two 
relatively prime integers m and n. If either m or n has a prime-square factor 
then so does mn, and both (mn) and p(m)y(n) are zero. If neither has a square 
factor write m = p; --- p, and n = q, ++- q; where the p; and q; are distinct 
primes. Then (m) = (— 1), p(n) = (—1) and u(mn) = (—1)*' = p(m)u(n). 
This shows that 4 is multiplicative. It is not completely multiplicative since 
(4) = 0 but p(2)u(2) = 1. 


EXAMPLE 4 The Euler totient (n) is multiplicative. This is part (c) of Theorem 
2.5. It is not completely multiplicative since (4) = 2 whereas ¢(2)p(2) = 1. 


EXAMPLE 5 The ordinary product fg of two arithmetical functions f and g 
is defined by the usual formula 


(f9)(n) = f(n)g(n). 
Similarly, the quotient f/g is defined by the formula 


(Sm = x whenever g(n) 4 0. 


If f and g are multiplicative, so are fg and f/g. If f and g are completely 
multiplicative, so are fg and f/g. 


We now derive some properties common to all multiplicative functions. 
Theorem 2.12 If f is multiplicative then f (Y) = 1. 


Proor. We have f(n) = f(1)f(n) since (n, 1) = 1 for all n. Since f is not 
identically zero we have f (n) # 0 for some n, so f (1) = 1. 


Note. Since A(1) — 0, the Mangoldt function is not multiplicative. 
Theorem 2.13 Given f with f(1) = 1. Then: 
(a) f is multiplicative if, and only if, 
So” ++ p) = f(p.?) -- f(p,") 


for all primes p; and all integers a; > 1. 
(b) Iff is multiplicative, then f is completely multiplicative if, and only if, 


Jo) = for 
for all primes p and all integers a > 1. 
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2.10: Multiplicative functions and Dirichlet multiplication 


Proor. The proof follows easily from the definitions and is left as an exercise 
for the reader. 


2.10 Multiplicative functions and Dirichlet 
multiplication 


Theorem 2.14 If f and g are multiplicative, so is their Dirichlet product f * g. 


Proor. Let h = f * g and choose relatively prime integers m and n. Then 


mn 
h(mn) = Y f ox). 
c|mn 

Now every divisor c of mn can be expressed in the form c — ab where a|m 
and b|n. Moreover, (a, b) — 1, (m/a, n/b) — 1, and there is a one-to-one 
correspondence between the set of products ab and the divisors c of mn. 


Hence 
hmn) = reba) 23 sorod ha) 
bln bln 
-Xf (7) X nox) = h(m)h(n). 
ajm aj bin 
This completes the proof. : o 


Warning. The Dirichlet product of two comp!etely multiplicative functions 
need not be completely multiplicative. 


A slight modification of the foregoing proof enables us to prove: 
Theorem 2.15 /f both g and f * g are multiplicative, then f is also multiplicative. 


Proor. We shall assume that f is not multiplicative and deduce that f» g is 
also not multiplicative. Let h = f *g. Since f is not multiplicative there 
exist positive integers m and n with (m, n) = 1 such that 


f(mn) # f (m) f (n). 


We choose such a pair m and n for which the product mn is as small as possible. 
If mn — 1 then f(1)  f(1/(l) so f(1) 1. Since h(1) = f(1)g(1) = 
f (1) # 1, this shows that h is not multiplicative. 
If mn > 1, then we have f (ab) = f(a) f (b) for all positive integers a and b 
with (a, b) = 1 and ab < mn. Now we argue as in the proof of Theorem 2.14, 
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2: Arithmetical functions and Dirichlet multiplication 


except that in the sum defining h(mn) we separate the term corresponding to 
a = m, b =n. We then have 


hn) = reb) + Som) = Y nore 2 + mn) 
ae FL 
-Y fax?) » sodz) — f(m)f (n) + f (mn) 
alm n 


= h(m)h(n) — f(m) f(n) + f(mn). 


Since f (mn) # f(m)f(n) this shows that h(mn) # h(m)h(n) so h is not multi- 
plicative. This contradiction completes the proof. m 


Theorem 2.16 Jf g is multiplicative, so is g^ ! , its Dirichlet inverse. 
Proor. This follows at once from Theorem 2.15 since both g and g «g^! = I 
are multiplicative. (See Exercise 2.34 for an alternate proof.) o 


Note. Theorems 2.14 and 2.16 together show that the set of multiplicative 


functions is a subgroup of the group of all arithmetical functions f with 
f) # 0. 


2.11 The inverse of a completely 
multiplicative function 


The Dirichlet inverse of a completely multiplicative function is especially 
easy to determine. 


Theorem 2.17 Let f be multiplicative. Then f is completely multiplicative if, 
and only if, 


f'n) = un) fn) for alln > 1. 
Pnoor. Let g(n) = u(n)f (n). If f is completely multiplicative we have 
(g* f)(n) = x aasas(§) = f(n) » (d) = f(n)I(n) = I(n) 
since f(1) = 1 and I(n) = 0 for n > 1. Hence g = f~}. 
Conversely, assume f~ t(n) = y(n) f(n). To show that f is completely 


multiplicative it suffices to prove that f (p°) = f(p) for prime powers. The 
equation f ^ '(n) = y(n) f (n) implies that 


xar?) =0  foraln 1. 
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2.12: Liouville’s function A(n) 


Hence, taking n = p* we have 
HO) S (0) GF + PSPS) = 0, 
from which we find f(p^) = f (p) (p^ ^). This implies f (p^) = f(p)*, so f 


is 
completely multiplicative. o 


EXAMPLE The inverse of Euler’s o function. Since ọ = u * N we have 
Q!-pu!*N -1 But N^! = uN since N is completely multiplicative, so 


9 !-pu'*ygN-usgN. 
Thus 


`n) = 144). 
The next theorem shows that 

o`n) = Ia - p. 
Theorem 2.18 Jf f is multiplicative we have 

2 Hid) f(a) = II (1 — fp). 

Proor. Let 

g(n) = duds (a). 
Then g is multiplicative, so to determine g(n) it suffices to compute g(p*). But 

g(p^) = P (d) = (1) 1) + n(pf(p) = 1 — f). 

Hence 


gn) = IIo? - [Ia - fP). a 


pin 


2.12 Liouville's function A(n) 


An important example of a completely multiplicative function is Liouville’s 
function 4, which is defined as follows. 


Definition We define A(1) = 1, and if n = pi - - - p;* we define 
An) = (1*7 t9. 
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The definition shows at once that 2 is completely multiplicative. The next 
theorem describes the divisor sum of /. 


Theorem 2.19 For every n > 1 we have 


zw- if n is a square, 


ain 0 otherwise. 
Also, à~ *(n) = |u(n)] for all n. 


Proor. Let g(n) = Y'4,4(d). Then g is multiplicative, so to determine 
g(n) we need only compute 9(p^) for prime powers. We have 


g(p") = 220 = 1 + Ap) + Ap?) + --- + Alp’) 


0 ifais odd, 


Sh os ee 
if ery {i if a is even. 


Hence if n = [Ji p? we have g(n) = [Ti-: 9(p2. If any exponent a; is 
odd then g(p,) = Oso g(n) = 0. Ifall the exponents a; are even then g(p;") = 1 
for all i and g(n) = 1. This shows that g(n) = 1 if n is a square, and g(n) = 0 
otherwise. Also, 47 !(n) = u(n)A(n) = p(n) = |u(n)]. O 


2.13 The divisor functions c. (n) 


: Definition For real or complex g and any integer n 7 1 we define 
adn) = » d', 
djn 
the sum of the ath powers of the divisors of n. 


The functions c, so defined are called divisor 
plicative because c, = u * N*, the Dirichlet pro 
functions 
When a = 0, a(n) is the number of divisors of n; this is often denoted by 
d(n). 

When « = 1, c,(n) is the sum of the divisors of n; this is often denoted by 
o(n). 


Since c, is multiplicative we have 


functions. They are multi- 
duct of two multiplicative 


o{p," «++ p") = api”) -+ o(p"). 
To compute c;(p*) we note that the divisors of a prime power p^ are 
L Dip? eps 
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hence 
pts — 1 . 
cp) = 1* + p p^ er pM = c^ ifa #0 
=at+l ifa = 0. 


The Dirichlet inverse of c, can also be expressed as a linear combination 
of the ath powers of the divisors of n. 


Theorem 2.20 For n > 1 we have 
e, (n) =>, exay(.. 
d|n 


Pnoor. Since c, = N*» u and N*is completely multiplicative we have 
6, ! = (uN*) «u^! = (uN?) * p. a 


2.14 Generalized convolutions 


Throughout this section F denotes a real or complex-valued function 
defined on the positive real axis (0, + co) such that F(x) = 0 for0<x « I. 


Sums of the type 
y ane (=) 
nsx n 


arise frequently in number theory. Here « is any arithmetical function. 
The sum defines a new function G on (0, +00) which also vanishes for 
0 <x < 1. We denote this function G by « » F. Thus, 


@>oF\(x)= Y, «ur(S) 


nsx 
If F(x) = 0 for all nonintegral x, the restriction of F to the integers is an 
arithmetical function.and we find that 
(a ° F)(m) = (x * F)(m) 


for all integers m > 1, so the operation » can be regarded as a generalization 
of the Dirichlet convolution *. 

The operation » is, in general, neither commutative nor associative. 
However, the following theorenrserves as a useful substitute for the associa- 
tive law. 


Theorem 2.21 Associative property relating » and «. For any arithmetical 
functions « and B we have 


(9) a c (Bo F) 2X * P) ° F. 
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Proor. For x > 0 we have 


{a0 (B°F)}(x)= Xa) Y pome( =) = PEL 


nsx msx/n 
k x x 
= (zy e ow) 
= {(@ * B) e F}{x). 
This completes the proof. o 


Next we note that the identity function I(n) = [1/n] for Dirichlet convolu- 
tion is also a left identity for the operation o. That is, we have 


(le F\(x) = Y FO = F(x). 


Now we use this fact alon, 


g with the associative property to prove the follow- 
ing inversion formula. 


Theorem 2.22 Generalized inversion formula. If « has a Dirichlet inverse a^! 
then the equation 


(10) Gx) = » ane(*) 
nsx n 
implies 
(11) FX) = Y we(*) 
nsx n 


Conversely, (11) implies (10). 
Proor. If G = ao F then 
a heG=a-to(aoF)=(a-!*o)oF =]oF =F. 
Thus (10) implies (11). The converse is similarly proved. o 


The following special case is of particular importance. 


Theorem 2.23 Generalized Möbius inversion formula. If a is completely 
multiplicative we have 


aw) = X ae) if, and only if, F(x) = Y; names). 


nsx 
Pnoor. In this case ^! (n) = u(n)a(n). o 
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2.15 Formal power series 


In calculus an infinite series of the form 


eo 
(12) Y; a(n)x" = a(0) + a(1)x + a(2)x? + --- + a(n)x" +++ 

n=0 
is called a power series in x. Both x and the coefficients a(n) are real or 
complex numbers. To each power series there corresponds a radius of 
convergence r >0 such that the series converges absolutely if |x| « r 
and diverges if |x| > r. (The radius r can be +.) 

In this section we consider power series from a different point of view. 
We call them formal power series to distinguish them from the ordinary 
power series of calculus. In the theory of formal power series x is never 
assigned a numerical value, and questions of convergence or divergence are 
not of interest. 

The object of interest is the sequence of coefficients 


(13) (a(0), a(1), ..., a(n), ...). 


All that we do with formal power series could also be done by treating the 
sequence of coefficients as though it were an infinite-dimensional vector with 
components a(0), a(1),... But for our purposes it is more convenient to 
display the terms as coefficients of a power series as in (12) rather than as 
components of a vector as in (13). The symbol x" is simply a device for 
locating the position of the nth coefficient a(n). The coefficient a(0) is called 
the constant coefficient of the series. 

We operate on formal power series algebraically as though they were 
convergent power series. If A(x) and B(x) are two formal power series, say 

E 


A(x) = ¥ a(n)x" and B(x) = b(n)x”, 
n=0 


n=0 


we define: 


Equality: A(x) = B(x) means that a(n) = b(n) for all n = 0. 
Sum: A(x) + B(x) = Lato (aln) + b(n)x". 
Product: A(x)B(x) = Yo-o c()x", where 


(14) c(n) = Y, alk)b(n — k). 
k=0 
The sequence {c(n)} determined by (14) is called the Cauchy product 


of the sequences {a(n)} and {b(n)}: . : 
The reader can easily verify that these two operations satisfy the commuta- 


tive and associative laws, and that multiplication is distributive with respect 
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to addition. In the language of modern algebra, formal power series form a 
ring. This ring has a zero element for addition which we denote by 0, 
0 = Y a(n)x", where a(n) = 0 for all n > 0, 
n=0 
and an identity element for multiplication which we denote by 1, 


1= J a(n)x", where a(0) = 1 and a(n) = 0 for n > 1. 


n=0 
A formal power series is called a formal polynomial if all its coefficients 
are 0 from some point on. 


For each formal power series A(x) = Y 7. o a(n)x" with constant coefficient 
a(0) # Othereisa uniquely determined formal power series B(x) = Y. b(n)x" 
such that A(x)B(x) = 1. Its coefficients can be determined by solving the 
infinite system of equations 

a(0)b(0) = 1 
a(0)b(1) + a(1)b(0) = 0, 
a(0)b(2) + a(1)b(1) + a(2)b(0) = 0, 


in succession for b(0), b(1), (2), ... The series B(x) is called the inverse of A(x) 
and is denoted by A(x)~! or by 1/A(x). 
The special series 


A(x) = 1+ Y a"x^ 
n=1 


is called a geometric series. Here a is 


: g ' an arbitrary real or complex number. 
Its inverse is the formal polynomial 


B(x) = 1 — ax. 
In other words, we have 
j| LJ] 
iam ls PE 


2.16 The Bell series of an arithmetical 
function 


E. T. Bell used formal power series to Study properties of multiplicative 
arithmetical functions. 


Definition Given an arithmetical function f and a prime p, we denote by 
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Spx) the formal power series 
fe) = Y sor 
and call this the Bell series of f modulo p. 
Bell series are especially useful when f is multiplicative. 


Theorem 2.24 Uniqueness theorem. Let f and g be multiplicative functions. 
Then f = g if, and only if, 


FAX) = g(x) for all primes p. 


Proor. If f = g then f(p") = g(p") for all p and all n > 0, so f,(x) = g(x). 
Conversely, if f(x) = g,(x) for all p then f(p") = g(p") for all n > 0. Since f 
and g are multiplicative and agree at all prime powers they agree at all the 
positive integers, so f — g. oO 


It is easy to determine the Bel! series for some of the multiplicative 
functions introduced earlier in this chapter. 


EXAMPLE 1 Möbius function jt. Since (p) = —1 and wp") = 0 for n > 2 
we have 


ux) = 1-— x. 


EXAMPLE 2 Euler's totient g. Since g(p") = p" — p"^! for n > 1 we have 


"— — eee ee 
n=1 n=0 n=0 


c Em 
e (L3 ats ica 


EXAMPLE 3 Completely multiplicative functions. If fis completely multiplica- 


tive then f(p") = f(p)" for all n > 0 so the Bell series f,(x) is a geometric 
series, 


E 1 
ff) = Y fer = TF 
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In particular we have the following Bell series for the identity function 7, 
the unit function u, the power function N*, and Liouville's function 4: 


Ijx) = 1. 


il 
1- x 


u,(x) = Pa = 


Nt zl myn = A 
p(x) F Lr E px 


A) = 2 1px" = DI 


2.17 Bell series and Dirichlet multiplication 


The next theorem relates multiplication of Bell series to Dirichlet multi- 
plication. 


Theorem 2.25 For any two arithmetical functions f and g let h = f x» g. Then 
for every prime p we have 


hx) = fol(x)g (x). 
Proor. Since the divisors of p” are 1, p, p?, . . . , p" we have 
Wo) = Y ra) = rone 
dip" k=0 


This completes the proof because the last sum is the Cauchy product of the 
sequences ( f(p")} and {g(p")}. (mj 


EXAMPLE ! Since u?(n) = 4^ }(n) the Bell series of 1? modulo p is 


1 
2 = = 
wlx) = i9) =14+x. 


EXAMPLE 2 Since c, = N** u the Bell series of c, modulo p is 


1 D 1 
-px1i-x 1-aepx-pxi 


(o,),(x) = NG(x)u,(x) = 1 


EXAMPLE 3 This example illustrates how Bell series can be used to discover 
identities involving arithmetical functions. Let 


f(n) = 2, 
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where v(1) = Oand v(n) = kifn = py! «++ p™. Then fis multiplicative and its 
Bell series modulo p is 


fal t+ YW alt Yawrat+ eek Ba. 
n=1 n=1 


1-x 1=—x 
Hence 
Syl) = Hp (ux) 
which implies f = 4? * u, or 
2*9 = F p(d). 
d|n 
2.18 Derivatives of arithmetical functions 


Definition For any arithmetical function f we define its derivative f" to be 
the arithmetical function given by the equation 


f'n) = f(mlogn forn > 1. 
EXAMPLES Since I(n)logn = 0 for all n we have J’ = 0. Since u(n) = 1 


for all n we have u’(n) = log n. Hence, the formula Ja A(d) = log n can be 
written as 


(15) A*uc-u. 
This concept of derivative shares many of the properties of the ordinary 
derivative discussed in elementary calculus. For example, the usual rules for 


differentiating sums and products also hold if the products are Dirichlet 
products. 


Theorem 2.26 If f and g are arithmetical functions we have: 
(à) (f +g) = f' +9 
(b (fxg) = f'*g + f *g- 
(c) (f^) = —f' * (f * f) provided that f(1) + 0. 


Proor. The proof of (a) is immediate. Of course, it is understood that 
f + is the function for which (f + g)(n) = f(n) + g(n) for all n. 
To prove (b) we use the identity log n = log d + log(n/d) to write 


(f «afe - Y; slde( 3) 
d|n 
n n n 
= Ls (dlog a) $ » f (S jos) 
=(f'«g)(n) + (f * g)(. 
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To prove (c) we apply part (b) to the formula I’ = 0, remembering that 
I= f » f~*. This gives us 


O=( ef Y=fefitt fap ty 
so 
fof = fx f 
Multiplication by f ^! now gives us 
Uy = fs fu fo fie (fot ef) 
But f^! « f^! = (f * f)! so (c) is proved. Oo 
2.19 The Selberg identity 
Using the concept of derivative we can quickly derive a formula of Selberg 


which is sometimes used as the starting point of an elementary proof of the 
prime number theorem. 


Theorem 2.27 The Selberg identity. For n > 1 we have 
A()log n + Y AW@A(*) = F p(d)log?”. 
d|n d d|n d 
Proor. Equation (15) states that A + u = w’, Differentiation of this equation 


gives us 


A'*u-c A*u =u" 
or, since u’ = A * u, 


A*u + Ax (A*u) =u". 
Now we multiply both sides by » = u^! to obtain 
N+AeA=u"«p. 


This is the required identity. 


Exercises for Chapter 2 
1. Find all integers n such that 
(a) oln) = n/2, (b) gn) = en), (c) gn) = 12. 
2. For each of the following statements either give a proof or exhibit a counter example. 


(a) If (m, n) = 1 then (g(m), g(n)) = 1. 
(b) If 1 is composite, then (n, (n) E. 
(c) If the same primes divide m and n, then ng(m) = mg(n). 
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3. Prove that 
fw yEO 
oln) dm o(d) 
4. Prove that ọ(n) > n/6 for all n with at most 8 distinct prime factors. 


5. Define v(1) = 0, and for n > 1 let v(n) be the number of distinct prime factors of n. 
Let f = p * v and prove that f (n) is either 0 or 1. 


6. Prove that 
Pld) = w(n) 


d?|n 


and, more generally, 


Yu) = 


t if m*|n for some m > 1, 
dijn 


otherwise. 


The last sum is extended over all positive divisors d of n whose kth power also 
divide n. 


7. Let j(p, d) denote the value of the Möbius function at the gcd of p and d. Prove that 
for every prime p we have 


1 ‘ifm=1, 
Y udu(p.d)-42 ifn=p5a>1, 
din À 

0 otherwise. 


8. Prove that 
Y. u(d)log" d = 0 


d|n 
ifm > 1 and n has more than m distinct prime factors. [Hint: Induction.] 


9. If x is real, x > 1, let g(x, n) denote the number of positive integers <x that are 
relatively prime to n. [Note that o(n, n) = q(n).] Prove that 


qx, n) = DE ] and PLE T a) = [x]. 


din 


In Exercises 10, 11, and 12, d(n) denotes the number of positive divisors of n. 

10. Prove that [Jip t = nP. 

11. Prove that d(n) is odd if, and only if, n is a square. 

12. Prove that Y, d)? = (Dein d(t). 

13. Product form of the Möbius inversion formula. If f (n) > 0 for all n and if a(n) is real, 
a(1) # 0, prove that i 


gln) = [] f(a" if, and only if, f0) = TT g(a", 


d|n din 


where b = a^ t, the Dirichlet inverse of a. 
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14. Let f(x) be defined for all rational x in 0 < x < 1 and let 


m Ll n d 
r= Ss) r= i0) 
(k,n)7 1 


(a) Prove that F* = p + F, the Dirichlet product of p and F. 


(b) Use (a) or some other means to prove that p(n) is the sum of the primitive nth 
roots of unity: 


u(n) = y erin, 
k=1 
(k,m-1 
15. Let ¢,(n) denote the sum of the kth powers of the numbers <n and relatively prime 
to n. Note that @o(n) = (n). Use Exercise 14 or some other means to prove that 
ed _ 14+- +n 
bg a 


16. Invert the formula in Exercise 15 to obtain, for n > 1, 


1 1 n 
gilt) -2ne(n, and ~—g,(n) =-= non) + =] ] (1 — p). 
2 3 6 bin 
Derive a corresponding formula for q(n). 


17. Jordan's totient J, is a generalization of Euler's totient defined by 


Jin) = nk I (1 =p. 
pin 
(a) Prove that 


k 
In) =F wl *) and nt =) J,(d). 
din d|n 


(b) Determine the Bell series for J, es 
18. Prove that every number of the form 2°- ‘(2° — 1) is perfect if 2" — 1 is prime. 
19. Prove that if n is even and perfect then n — 2*7!" — 1) for some a > 2. It is not 
known if any odd perfect numbers exist. It is known that there are no odd perfect 
numbers with less than 7 prime factors. 


20. Let P(n) be the product of the positive integers which are «n and relatively prime 
to n. Prove that 


d iate) 
P(n) = n* [T (5) 


din 


21. Let f(n) — L/n] — [Vn — 1]. Prove that f is multiplicative but not completely 
multiplicative. 


22. Prove that 


a(n) => e (o) 


din 
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and derive a generalization involving o,(n). (More than one generalization is 
possible.) 

23. Prove the following statement or exhibit a counter example. If f is multiplicative, 
then F(n) = [Jan / (4) is multiplicative. 


24. Let A(x) and B(x) be formal power series. If the product A(x)B(x) is the zero series, 
prove that at least one factor is zero. In other words, the ring of formal power series 


has no zero divisors. 


25. Assume f is multiplicative. Prove that: 


(a) f^ !(n) = a(n) f(n) for every squarefree n. 
(b) f^ (p^) = fP)? — f(p?) for every prime p. 


26. Assume f is multiplicative. Prove that f is completely multiplicative if, and only 
if, f 7 (p^) = 0 for all primes p and all integers a > 2. 


27. (a) If f is completely multiplicative, prove that 
f-@*h)=(f-9)*(f-h) 
for all arithmetical functions g and h, where f : g denotes the ordinary product, 


(f -g)(n) = fmi). 
(b) If f is multiplicative and if the relation in (a) holds for g = p and h = u^ 
prove that f is completely multiplicative. 


28. (a) If f is completely multiplicative, prove that 
SD =S 


for every arithmetical function g with g(1) # 0. 
(b) If f is multiplicative and the relation in (a) holds for g = 4^ +, prove that f is 
completely multiplicative. 


29. Prove that there is a multiplicative arithmetical function g such that 


$ s,m) gradi) 
din 


k=1 


for every arithmetical function f. Here (k, n) is the ged of n and k. Use this identity to 
prove that 


Y, nuk, n) = pr). 
k=1 


30. Let f be multiplicative and let g be any arithmetical function. Assume that 


@ fe") =fOF0")— g(p)f(p"~*) for all primes p and all n > 1. 


Prove that for each prime p the Bell series for f has the form 


1 
^ {= 1— fasce ape" 


Conversely, prove that (b) implies (a). 
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31. (Continuation of Exercise 30.) If g is completely multiplicative prove that statement 
(a) of Exercise 30 implies 


f(mft)- y, aos) 
d|(m, n) d 


where the sum is extended over the Positive divisors of the gcd (m, n). [Hint : Consider 
first the case m = p°, n = p^] 


32. Prove that 
n 
e.m)e.n) - Y, aro") 


d\(m,n) 


33. Prove that Liouville’s function is given by the formula 


n 
wo- Zz) 


34. This exercise describes an alternate proof of Theorem 2.16 which states that the 
Dirichlet inverse of a multiplicative function is multiplicative. Assume g is multi- 
plicative and let f = g~!. 


(a) Prove that if p is prime then for k > 1 we have 
k 
S= — Yaris. 
t=) 


(b) Let h be the uniquely determined multiplicative function which agrees with f 
at the prime powers. Show that h « g agrees with the identity function J at the 
prime powers and deduce that h + g = I. This shows that f = h so f is multi- 
plicative. 


35. If f and g are multiplicative and if a and b are positive integers with a > b, prove 


that the function h given by 
nY fn 
h(n) = ~= l-z 
lee (ay (5) 


is also multiplicative. The sum is extended over those divisors d of n for which d^ 
divides n. 


MOBIUS FUNCTIONS OF ORDER k. 

If k 2 1 we define Hr, the Möbius function of order k, as follows: 
al) = 1, 
Hn) = Oif p**'|n for some prime p, 


An) =(—1N ifn = pt-pt] pt, <a <k, 


ta(n) = 1 otherwise. 


In other words, H,(n) vanishes if n is divisible by the (k + 1)st power of some 
prime; otherwise, (n) is 1 unless the prime factorization of n contains the 


50 » 


Exercises for Chapter 2 


kth powers of exactly r distinct primes, in which case u(n) = (— 1)’. Note that 
Hı = p, the usual Möbius function. 
Prove the properties of the functions y, described in the following exercises. 


36. If k > 1 then p,(n*) = p(n). 


37. Each function y, is multiplicative. 


38. If k > 2 we have 
n n 
nun) = L^ (5). () 


Iun) = 3 Y Hd). 


EET 


39. If k > 1 we have 


40. For each prime p the Bell series for 4, is given by 
1 — 2xk  x**! 
1I-x i 


(Hy) (0) = 
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Arithmetical Functions 


3.1 Introduction 


The last chapter discussed various identities satisfied by arithmetical func- 
tions such as ų(n), p(n), A(n), and the divisor functions c,(n). We now inquire 
about the behavior of these and other arithmetical functions f(n) for large 
values of n. 

For example, consider d(n), the number of divisors of n. This function takes 
on the value 2 infinitely often (when n is prime) and it also takes on arbitrarily 
large values when n has a large number of divisors. Thus the values of d(n) 
fluctuate considerably as n increases. 

Many arithmetical functions fluctuate in this manner and it is often 
difficult to determine their behavior for large n. Sometimes it is more fruitful 
to study the arithmetic mean 


jo -ly stb. 
nazi 


Averages smooth out fluctuations so it is reasonable to expect that the mean 
values f(n) might behave more regularly than f (n). This is indeed the case 
for the divisor function d(n). We will prove later that the average d(n) grows 
like log n for large n; more precisely, 


(1) : lim do) 


=1. 
aso logn | 


This is described by saying that the average order of d(n) is log n. 
To study the average of an arbitrary function f we need a knowledge 
of its partial sums )%-, f(k). Sometimes it is convenient to replace the 
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upper index n by an arbitrary positive real number x and to consider instead 
sums of the form 


Y fH. 


ksx 


Here it is understood that the index k varies from 1 to [x], the greatest 
integer <x. If0 < x < 1 the sum is empty and we assign it the value 0. Our 
goal is to determine the behavior of this sum as a function of x, especially 


for large x. 
For the divisor function we will prove a result obtained by Dirichlet 


in 1849, which is stronger than (1), namely 
Q) F dik) = x log x + QC — Dx + OC /x) 
ksx 
for all x > 1. Here C is Euler's constant, defined by the equation 
(3) TDI 
The symbol O( /x) represents an unspecified function of x which grows no 


faster than some constant times x. This is an example of the “big oh” 
notation which is defined as follows. 


3.2 The big oh notation. Asymptotic equality 
of functions 
Definition If g(x) > 0 for all x 2 a, we write 
f(x) = O(gC9) (read: “ f(x) is big oh of g(x)") 


to mean that the quotient f! (x)/g(x) is bounded for x 2 4; that is, there 


exists a constant M > 0 such that 
|f691 € Mga) for all x > a. 


An equation of the form 
f(x) = ho) + olg) 


means that f(x) — h(x) = OU). We note that f(t) = O(g()) for t >a 


implies f* f(t) dt = O(fz 9 40 for x 2 a. 
Definition If 

we say that f(x) is asymptotic to g(x) as x — oo, and we write 
f(x) ~ gx) asx o. 
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For example, Equation (2) implies that 


Yd)-xlogx asx- oo. 


ksx 


In Equation (2) the term x log x is called the asymptotic value of the sum; 
the other two terms represent the error made by approximating the sum 
by its asymptotic value. If we denote this error by E(x), then (2) states that 


(4) E(x) = QC — 1)x + O(/). 


This could also be written E(x) = O(x), an equation which is correct but 
which does not convey the more precise information in (4). Equation (4) 
tells us that the asymptotic value of E(x) is (2C — 1)x. 


3.3 Euler’s summation formula 


Sometimes the asymptotic value of a partial sum can be obtained by com- 
paring it with an integral. A summation formula of Euler gives an exact 
expression for the error made in such an approximation. In this formula 
[t] denotes the greatest integer «t. 


Theorem 3.1 Euler's summation formula. Jf f has a continuous derivative f 
on the interval [ y, x], where 0 < y < x, then 


(5) Y s= [roa fe- tarea 
y<nsx y y 


+ SOE] — x) — fo)(01 - ». 


Proor. Let m = [y], k = [x]. For integers n and n — 1 in Ly, x] we have 


f proa - f m- nea =m nt - fa- D 
n- n-1 
= {nf(n) - (n — Df(n — 0) — fin). 


Summing from n = m + 1 to n = k we find 


k k 
faro d= Y inf) -(-10)fn—-1))— Y fm 
n=m+ ysnsx 
=Kfk)-—mfm- Y, fin, 
y<nsx 
hence 
k 

(6) PT (n) = -[ [elf dt + kf(k) — mf (m) 


- [taro ae + Wyte) - mro). 
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Integration by parts gives us 
[rod = sre) - 90) - [roa 
y y 
and when this is combined with (6) we obtain (5). o 


3.4 Some elementary asymptotic formulas 


The next theorem gives a number of asymptotic formulas which are easy 
consequences of Euler’s summation formula. In part (a) the constant C is 
Euler's constant defined in (3). In part (b), ¢(s) denotes the Riemann zeta 
function which is defined by the equation 


a= TL weed 
nci? 


and by the equation 


t(s) = tim (55-7 —) if0<s<1. 


xo \nsx M 


Theorem 3.2 /f x > 1 we have: 


(a) g t= togx ++ oi} 


nsx 


O I eat +00 5 fs>0sA4l. 


() X mec ifs» 1. 


(d) 2e- pt Oe) far. 


Proor. For part (a) we take (t) = 1/t in Euler’s summation formula to 
obtain 

1 wh itd curd Lx] 

t 1 t x 
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The improper integral (? (t — [¢])t~? dt exists since it is dominated by 


J? t7? dt. Also, 
co — oa 
o< f Cas St = 
x t "T x 
so the last equation becomes 
1 ? t — [t] 1 
-=l 1- — =— dt + 0|- |. 
p n io [ rs E 


This proves (a) with 


c-i- l'a. 
1 


Letting x — oo in (a) we find that 


tim X yo ex)=1- [as 
1 


xa \nsx 


so C is also equal to Euler’s constant. 


To prove part (b) we use the same type of argument with f(x) = x^*, 
where s > 0, s # 1. Euler's summation formula gives us 


i. (Pa Fes 
= s-s f: [J 
1 


x- b] 
Ew F pri ee x 


Eu o sul ?t-[r] t 
-IMCputi sf dt + O(x-. 


gt 1 
Therefore 
1 x! ag 
(7) eee 5 
Tost 0900675, 
where 


Di d S EH 
Cs) = 1 Tab par di. 
If s > 1, the left member of (7) approaches 
and x~* both approach 0. Hence C(s) = 
and (7) shows that 


Us) as x — oo and the terms x!~* 
Üs)ifs» 1110-s- 1x20 


4 1 1-s 
(z E-E) as 


Therefore C(s) is also equal to ((s) if 0 < s < 1. This proves (b). 
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To prove (c) we use (b) with s > 1 to obtain 


1 1-s 
Xl-q)-Y2-2--005-0079 


nx nsx" S 


since x^? < x! ^*. 
Finally, to prove (d) we use Euler's summation formula once more with 
f(t) = t* to obtain 


Xr- fr dt +a fee- [t]) dt + 1 — (x — [x])x* 


nsx 1 1 


ati 1 x 
z ik (s Í gi a) + O(x?) 
1 


“atl ati 
al 
= pL ee E 


3.5 The average order of d(n) 


In this section we derive Dirichlet’s asymptotic formula for the partial sums 
of the divisor function d(n). 


Theorem 3.3 For all x > 1 we have 
(8) F din) = x log x + (2C — 1)x  0(/»), 
nsx 


where C is Euler’s constant. 


Proor. Since d(n) = ¥ gin 1 we have 


¥de@)= Yt 

nsx nsx din 
This is a double sum extended over n and d. Since d|n we can write n = qd 
and extend the sum over all pairs of positive integers q, d with qd < x. Thus, 


(9) Yda)= t 
qd 


nsx » 
qdsx 

This can be interpreted as a sum extended over certain lattice points in the 
qd-plane, as suggested by Figure 3.1. (A lattice point is a point with integer 
coordinates.) The lattice points with qd = n lie on a hyperbola, so the sum in 
(9) counts the number of lattice points which lie on the hyperbolas corre- 
sponding to n = 1, 2,..., [x]. For each fixed d < x we can count first those 
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lattice points on the horizontal line segment 1 € q < x/d, and then sum over 
„all d € x. Thus (9) becomes 


(10) X4dp- Y Yi 


nsx dsx q<x/d 
Now we use part (d) of Theorem 3.2 with « = 0 to obtain 
ye : + O(1). 
qsxi/d 
Using this along with Theorem 3.2(a) we find 
x 1 
Ldn) = ¥ E * ow} =x} 5+ O(x) 
nsx x d dzxd 


ds 


1 
= fos x+C+4+ 9 + O(x) = x log x + O(x). 
This is a weak version of (8) which implies 


È din) ~ xlogx asx- œ 
nsx 
and gives log n as the average order of d(n). 

To prove the more precise formula (8) we return to t 
counts the number of lattice points in a hyperbolic region and take advantage 
of the symmetry of the region about the line q = d. The total number of 
lattice points in the region is equal to twice the number below the line q=d 


he sum (9) which 
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Vd. [5 ] - « lattice points on this segment 


Figure 3.2 


plus tne number on the bisecting line segment. Referring to Figure 3.2 we 


see that 
Ldn) =2 > IH - a} HRA. 
dsJx 


nsx 


Now we use the relation [y] = y + O(1) and parts (a) and (d) of Theorem 3.2 
to obtain 


Y49-2Y E -d4 ow} + O/%) 
ds/x 


nsx 


22x Y j-2Y 4 0/9 


dsJjx dsJx 
1 
i 2xf 0g Jett «(7 = 2E + a/a} + 0(/x) 
x 
2 x log x + QC — 1)x + 0/5). 
This completes the proof of Dirichlet’s formula. fal 


Note. The error term O(./x) can be improved. In 1903 Voronoi proved 
that the error is O(x!/? log x): in 1922 van der Corput improved this to 
Q(x33!100). The best estimate to cate is O(x 2137) +2) for every € > 0, obtained 
by Kolesnik [35] in 1969. The determination of the infimum of all 0 such that 
the error term is O(x) is an unsolved problem known as Dirichlet's divisor 
problem. In 1915 Hardy and Landau showed that inf 0 > 1/4. 
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3.6 The average order of the divisor 
functions c, (n) 


The case « = 0 was considered in Theorem 3.3. Next we consider real « > 0 
and treat the case a = 1 separately. 


Theorem 3.4 For all x 7 1 we have 
(11) ¥ om) == jue + O(x log x). 
nsx 


Note. It can be shown that ¢(2) = 22/6. Therefore (11) shows that the 
average order of c,(n) is 1?n/12. 


Pnoor. The method is similar to that used to derive the weak version of 
Theorem 3.3. We have 


Y oi YYXa- zaz DDE, 


nsx qin dsx qąsxjd 
ace 


-ZEO +o} 3 E+ ez) 


= 11 109. o 1 4 ote og9—4 coe 
=o — Zt i zal (x og x)= 5 (2x + O(x log x), 


where we have used parts (a) and (b) of Theorem 3.2. Lr 
Theorem 3.5 /f x > 1 anda > 0,a 4 1, we have 
: 
Lod = EED yen 4 og 
nsx 
where B = max(1, a}. 
Pnoor. This time we use parts (b) and (d) of Theorem 3.2 to obtain 


Xon-YXeé-X Y« 


nsx n&x qin dx qsx/d 


1 xVeti J xs xt 1 
Een! «Iis eZ 2) 
xt*1 xt T 

"Iul j 


1-a 
+ (o) + oco) 
Ey ‘ 


= SS x + Olx) + Ol) + OQ) = Het) uen LOGA 
where f = max{1, a}. 
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3.7: The average order of p(n) 
" To find the average order of o,(n) for negative « we write « = — B, where 
20. i 
Theorem 3.6 If f > 0 let ô = max{0, 1 — B). Then if x > 1 we have 


Doan) = (B+ 1)x +O’) if B #1, 


nsx 


= ((2)x + O(log x) ifß= 1. 


Proor. We have 
1 1 
Y o_,(n) = == E 
be em hee 


-Tayrow -x) 1 1 
de. dh |d * E gre mg 2 2) 
The last term is O(log x) if f = 1 and O(x?) if f # 1. Since 


] xA 
x = gu--g + (B+ 1)x + O(x^^) = UB + 1)x + O(x!~) 


this completes the proof. o 


3.7 The average order of p(7) 


The asymptotic formula for the partial sums of Euler’s totient involves the 
sum of the series 


r 
n, 
Ie. 


This series converges absolutely since it is dominated by Y, n7?. Ina later 
chapter we will prove that 


e (n 1 6 
12 ————- 
nd x n? wu) Um 
Assuming this result for the time being we have 

gla) _ wn) y ae) 

Lele hw 


by part (c) of Theorem 32. We now use this to obtain the average order of (n). 
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Theorem 3.7 For x > 1 we have 
3 
(13) È oln) =a x? + O(x log x), 


so the average order of q(n) is 3n/n?. 


Proor. The method is similar to that used for the divisor functions. We start 
with the relation 


on) = Y, ud) 
din 


and obtain 
= m. dus d 
Zo Epsei- pem- Fai) Da 


qdsx 


12) 6 et 
=5* 5 + o(z)} + O(x log x) = z* + O(xlogx) O 


3.8 An application to the distribution of 
lattice points visible from the origin 


The asymptotic formula for the partial sums of q(n). has an interesting 


application to a theorem concerning the distribution of lattice points in the 
plane which are visible from the origin. 


Definition Two lattice points P and 
line segment which joins them c 
endpoints P and Q. 


Q are said to be mutually visible if the 
ontains no lattice points other than the 


Theorem 3.8 Two lattice points (a, b) and (m, n) are mutually visible if, and 
only if, a — m and b — n are relatively prime. 


Pnoor. It is clear that (a, b) and (m, n) are mutually visible if a 
(a — m, b — n)is visible from the origin. Hence it suffices to provet 
when (m, n) — (0, 0). ` 

Assume (a, b) is visible from the origin, and let d = 
prove that d = 1. Ifd > 1 thena = da',b = db’ 
on the line segment joining (0, 0) to (a, 
d=1. 
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nd only if 
he theorem 


(a, b). We wish to 
and the lattice point (a', b’) is 
b). This contradiction proves that 


3.8: An application to the distribution of lattice points visible from the origin 


Conversely, assume (a, b) = 1. Ifa lattice point (a', b’) is on the line segment 
joining (0, 0) to (a, b) we have 
a’ = ta, b'—tb, where0 «t <1. 
Hence t is rational, so t = r/s where r, s are positive integers with (r, s) = 1. 
Thus 
sa = ar and sb' = br, 
so s|ar, s|br. But (s, r) = 1 so sla, s|b. Hence s — 1 since (a, b) — 1. This 


contradicts the inequality 0 < t < 1. Therefore the lattice point (a, b) is 
visible from the origin. 


There are infinitely many lattice points visible from the origin and it is 
natural to ask how they are distributed in the plane. 
Consider a large square region in the xy-plane defined by the inequalities 


Ix| <r, lyl <r. 


Let N(r) denote the number of lattice points in this square, and let N'(r) 
denote the number which are visible from the origin. The quotient N '(r)N(r) 
measures the fraction of those lattice points in the square which are visible 
from the origin. The next theorem shows that this fraction tends to a limit as 
r — oo. We call this limit the density of the lattice points visible from the 


origin. 
Theorem 3.9 The set of lattice points visible from the origin has density 6/n?. 
Pnoor. We shall prove that 


im N42. 
IDNO x- 


The eight lattice points nearest the origin are all visible from the origin. 
(See Figure 3.3.) By symmetry, we see that N'(r) is equal to 8, plus 8 times the 
number of visible points in the region 


{(x,y):2 <x <T, 1<y<x}, 


(the shaded region in Figure 3.3). This number is 
N()2848 X Y 128 Y of). 


2<nsr lsm<n isnsr 
(m,n)=1 


Using Theorem 3.7 we have 


24 
N= r? + O(r log r). 
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ji 


Figure 3.3 


But the total number of lattice points in the square is 


N(r) = Q[r] + 1? = Qr + O(1))? = 422.4. O(r) 


so 
24 , 6 log r 
Ne) zl + O(r log r) mt eee") 
Se LUND, SR. A 
N(r) 4r* + O(r) jen e() 
5 
Hence as r — co we find N'(ryN(r) > 6/z?. [m] 


Note. The result of Theorem 3.9 is sometimes described by saying that a 
lattice point chosen at random has probability 6/1? of being visible from the 


origin. Or, if two integers a and b are chosen at random, the probability that 
they are relatively prime is 6/12. 


3.9 The average order of u(n) and of A(n) 
The average orders of u(n) and A(n) are considerably more difficult to deter- 


mine than those of y(n) and the divisor functions. It is known that y(n) has 
average order 0 and that A(n) has average order |. That is, 


xo X 


wi 

lim — Y un = 0 
nsx 

and 


Ab 
lim g PU mi 


x-^o 


3.10: The partial sums of a Dirichlet product 


but the proofs are not simple. In the next chapter we will prove that both these 
results are equivalent to the prime number theorem, 


lim z(x)log x ei 


xo x 


where z(x) is the number of primes < x. 

In this chapter we obtain some elementary identities involving y(n) and 
A(n) which will be used later in studying the distribution of primes. These 
will be derived from a general formula relating the partial sums of arbitrary 
arithmetical functions f and g with those of their Dirichlet product f * g. 


3.10 The partial sums of a Dirichlet product 
Theorem 3.10 [fh = f* g, let 
H(x)= Ð hin), —Fo)- LS),  andG(x)- Y (m) 
nsx nsx 


nsx 


Then we have 
(14) H(9)- X sool) => sor?) 
nsx nsx n 


Pnoor. We make use of the associative law (Theorem 2.21) which relates the 
operations e and *. Let 


ua) = f roce 
bx) =) rail iss 1: 


Then F = f » U, G = g » U, and we have 


feG=feogeU)=(f*g)°U =H, 
g°F=g°(f - U) = (g » f)» U = H. 


This completes the proof. n 


If g(n) = 1 for all n then G(x) = [x], and (14) gives us the following 
corollary: 


Theorem 3.11 f F(x) = $ „<x f(n) we have 


(15) zyra- zrol] zf) 


nsx d|n nsx 
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3.11 Applications to u(n) and A(n) 
Now we take f(n) = p(n) and A(n) in Theorem 3.11 to obtain the following 
identities which will be used later in studying the distribution of primes. 


Theorem 3.12 For x > 1 we have 


x 
(16) X OH =1 


nsx 
and 


(17) » OH = log [x]!. 


nsx 


Proor. From (15) we have 


zoil- 2200 z[]-: 


and 


2: OH = Y A)- F logn = log[x]!. D 
nsx nsx 


nsx d|n 


Note. The sums in Theorem 3.12 can be regarded as weighted averages 
of the functions ju(n) and A(n). 


In Theorem 4.16 we will prove that the prime number theorem follows 
from the statement that the series 


we mn) 
ATP 


converges and has sum 0, Using (16) we can prove that this series has bounded 
partial sums. 


Theorem 3.13 For all x > 1 we have 


(18) y ^ 


nsx n 
with equality holding only ifx <2, 
Proor. If x < 2 there is only one 


term in th: = 
that x > 2. For each real y let (y) = y — Di. ee 1. Now assume 


1= esl ]- zo- H-z y uo 
nsx nex n 


M 


3.11: Applications to a(n) and A(n) 
Since 0 < {y} < 1 this implies 


zei Zoot e Zo) 


nsx 


=1 E Y Plito +- 
2snsx(” 
Dividing by x we obtain (18) with strict inequality. o 


We turn next to identity (17) of Theorem 3.12, 


(17) p CH = log(x]!, 


nsx 


and use it to determine the power of a prime which divides a factorial. 


Theorem 3.14 Legendre's identity. For every x > 1 we have 


(19) fx]! = [Tp 


psx 
where the product is extended over all primes <x, and 


(20) a(p) = X [z] 


m-1 
Note. The sum for o(p) is finite since [x/p"] = 0 for p > x. 


Pnoor. Since A(n) = 0 unless n is a prime power, and A(p") — log p, we have 


GJ 


log[x]! = POH D B = Z posp, 


psxm=1 


where a(p) is given by (20). The last sum is also the logarithm of the product 
in (19), so this completes the proof. n 


Next we use Eulers summation formula to determine an asymptotic 
formula for log[x] '- 


Theorem 3.15 If x > 2 we have 


(21) log[x]! = xlog x — x + O(log x), 

and hence 

Q2) DAM) B = x log x — x + O(log x). 
nsx 
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Pnoor. Taking f(t) = log t in Euler's summation formula (Theorem 3.1) 
we obtain 


Zlogn= f'logcar + [ HH dt — (x — [x])log x 
nsx 
f= DEI. US 
zxlogx-—x-rl- f en. + O(log x). 
1 
This proves (21) since 
PE x1 
I P ue «jf rat) = O(log x), 
1 t it 
and (22) follows from (17). o 


The next theorem is a consequence of (22). 


Theorem 3.16 For x > 2 we have 


(23) » [z] log p = x log x + O(x), 


psx 


where the sum is extended over all primes <x. 


Proor. Since A(n) = 0 unless is a prime power we have 


PHIOS z È [aun 


"mas 


Now p" < x implies p < x. Also, [x/p"] = 0 if p > x so we can write the 
last sum as 


pP E[bee-r IHE X XE s Jon 


pSxm-2 


Next we prove that the last sum is O(x). We have 


Zier Š |=] < lop pee v (1\" 
psx S E 2 p" TE xD log 2» p 


e| 1 1 
=x} log p.—- x logp 
LE P nx psx P(p — 1) 
P 


z5 logn 
n=2 n(n — 1) 


= O(x). 
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Hence we have shown that 


gEpo- z [jer + o 


which, when used with (22), proves (23). [m] 


Equation (23) will be used in the next chapter to derive an asymptotic 
formula for the partial sums of the divergent series Y tp). 


3.12 Another identity for the partial sums of 

a Dirichlet product 
We conclude this chapter with a more general version of Theorem 3.10 that 
will be used in Chapter 4 to study the partial sums of certain Dirichlet 
products. 

As in Theorem 3.10 we write 
Fx)= Df),  Go)- Xm, and Hx) = X * gn) 
nsx nsx nsx 


so that 
Hw) = Y x rax) = Lfd. 


qdsx 


Theorem 3.17 If a and b are positive real numbers such that ab = x, then 


es ¥ sede) = Zraci) + Zanr) - reco) 
q.d nsa ns 
qdsx 


at 


Figure 3.4 
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Proor. The sum H(x) on the left of (24) is extended over the lattice points in 
the hyperbolic region shown in Figure 3.4. We split the sum into two parts, 
one over the lattice points in A U B and the other over those in B u C. The 
lattice points in B are covered twice, so we have 


H()- X f(ea- f(de(a- E È f (oa). 


dza q<x/d azb d<x/q dSaqsb 


which is the same as (24). Tz] 


Note. Taking a — 1 and b — 1, respectively, we obtain the two equations 
in Theorem 3.10, since f(1) = F(1) and g(1) = G(1). 
Exercises for Chapter 3 


1. Use Euler's summation formula to deduce the following for x > 2: 


a Y, 


nsx 


E D. 


log x 
2 bog XctA- o( 82 E ) where A is a constant. 


(b) 


asse n log n 


1 
= log(log x) + B+ of ) where B is a constant. 
x log x. 


2 If x > 2 prove that 


din) 1 
y—- 3 log? x + 2C log x + O(1), where C is Euler's constant. 


nsx 


3. Ifx>2anda>0,a4 1, prove that 


d AE 
2 OO TBE say + oan 


nsx M 


4. If x 2 2 prove that: 


(a) Yu: | -2 ij * 9 log 9. 


nsx 


pn) | x 
Di BE: qd lees 
5. If x > 1 prove that: 


6) Dom =4 POF IE x 
nsx 2 s 23 
nsx nsx n |n 
These formulas, together with those in Exercise 4, show that, for XI 
9m) x 
Lsn- ; O ate O(x log x) and Dx n “IQ + O(log x). 
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10. 


n. 


. In a later chapter we will prove that Yes een 


- In a later chapter we 


Exercises for Chapter 3 


If x > 2 prove that 


where C is Euler’s constant and 
æ p(njlogn 


A= z 


-a = 1/4) ifa > 1. Assuming this, 


prove that for x > 2 and x > 1, a # 2, we have 
y oy EA o + O(x!^* log x). 
nm 2-a«Q 9 


nsx 


. Ifa < 1 and x > 2 prove that 


e x jila 1-2 Jog x). 
ES Eno Mir 


nsx 


i infini duct [], (1 — p^?) extended 
will prove that the infinite pro ? I eee 
over all primes, converges to the value 1/f(2) = 6/n2. Assuming this resu't, P 


that 


"LOBO TES: 


7" ^ 6n 


LHint: Use the formula q(n) = ” TI» ( ; 
1 Q1** withx= ‘] 
p. 


e 


Lick xx d EE E 


1-93) and the relation 


(b) If x > 2 prove that 


n 
—— = 0(x): 
L (n) 
If x > 2 prove that 
1 
+ = O(log x) 
Lon 


Let o,(n) = n Fan ln(d)/d- 


(a) Prove that ; is multipli 
(b) Prove that 


= 
cative and that 9,(9) =" TI» +P} 


n 
ou) = X wa) 


di|n 
for which d" |n. 


" ivi ofn 
where the sum is over those divisors 


(c) Prove that 
Leil) = X KASI E where S(x) = $, alk), 
it ) PET ( ) : ) ksx 


nsx 
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then use Theorem 3.4 to deduce that, for x > 2, 


%2) 
pL 1(n) = XA) 


As in Exercise 7, you may assume the result yan H(n)n~* = 1/€(a) fora > 1. | 


x? + O(x log x). 


12. For real s > O and integer k > 1 find an asymptotic formula for the partial sums 


X 1 
nsx M. 
(n,K)= 1 


with an error term that tends to 0 as x + co. Be sure to include the case s = 1. 


PROPERTIES OF THE GREATEST-INTEGER FUNCTION 


For each real x the symbol [x] denotes the Breatest integer « x. Exercises 13 
through 26 describe some properties of the greatest-integer function. In these 
exercises x and y denote real numbers, n denotes an integer. 


13. Prove each of the following statements: 
(a) If x = k + y where k is an integer and 0 « Yy < 1, then k = [x]. 
(b) [x + n] = [x] + n. 
ELLE: if x = [x], 
ep « -[]-1 ifx# [x]. 
(d) Cx/n] = [[x]/n]. ifn > 1. 
14. If0 < y < 1, what are the possible values of [x] - [x - y]? 


I5. The number {x} = x — x] is called the fractional part of x. It satisfies the in- 


equalities 0 < (x) < 1, with: {x} = 0 if, and only if, x is an integer. What are the 
Possible values of {x} + {—x}? 


16. (a) Prove that [2x] — 2[x] is either 0 or 1. 
(b) Prove that [2x] + [2y] > [x] + [Dr] + [x + y]. 
17. Prove that [x] + [x + 4] = [2x] and, more generally, 


* È + d = [nx]. 


k=0 
18. Let f(x) = x — [x] — 1. Prove that 


n-1* 


» s(x + B = f(nx) 
k=0 n 
and deduce that 


= 1 
PC * 3)| S1 rorallm > 1 and all real x: 


19. Given positive odd integers h and k, (h, k) = 1, leta = (k — 12,5 — MAN 
(a) Prove that 57. , [hr/k] + 37-1 [kr/h] = ab. Hint. Lattice points 
(b) Obtain a corresponding result if (h, k) =. i 
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Exercises for Chapter 3 


20. If n is a positive integer prove that [n + /n + 1] = [V4n + 2]. 
21. Determine all positive integers n such that [Vn] divides n. 


22. If n is a positive integer, prove that 


n—17 
E »-2- [Ez 


25 
is independent of n. 


23. Prove that 


X CH = [V7]. 


nsx n 


24. Prove that 
25. Prove that 


and that 


PHa 


26. Ifa = 1,2,.., 7 prove that there exists an integer b (depending on a) such that 


iL-Pz] 
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Some Elementary Theorems on the 
Distribution of Prime Numbers 


4.1 Introduction 


Ifx > Olet x(x) denote the number of primes not exceeding x. Then n(x) — oo 
as x — co since there are infinitely many primes. The behavior of n(x) as a 
function of x has been the object of intense study by many celebrated mathe- 
maticians ever since the eighteenth century. Inspection of tables of primes 


led Gauss (1792) and Legendre (1798) to conjecture that z(x) is asymptotic to 
x/log x, that is, 


z(x)log x E 


lim =1, 


x70 


This conjecture was first proved in 1896 by Hadamard [28] and de la Vallée 
Poussin [71] and is known now as the prime number theorem. 

Proofs of the prime number theorem are 
elementary, depending on the methods used t 
Hadamard and de la Vallée Roussin is an 
theory and properties of the Riemann zeta 
was discovered in 1949 by A. Selberg and 
use of the zeta function nor of complex function theory but is quite intricate. 
At the end of this chapter we give a brief outline of the main features of the 


elementary proof. In Chapter 13 we present a short analytic proof which is 
more transparent than the elementary proof. 

This chapter is concerned primarily with elementary theorems on primes. 
In particular, we show that the prime number theorem can be expressed in 
several equivalent forms. 


often classified as analytic or 
o carry them out. The proof of 
alytic, using complex function 
function. An elementary proof 
P. Erdós. Their proof makes no 
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4.2: Chebyshev's functions (x) and 3(x) 


For example, we will show that the prime number theorem is equivalent 
to the asymptotic formula 
(1) Y Atn)~ x asx 00. 
The partial sums of the Mangoldt function A(n) define a function introduced 
by Chebyshev in 1848. 


4.2. Chebyshev’s functions y/(x) and B(x) 


Definition For x > 0 we define Chebyshev’s y-function by the formula 


vx) = X AQ). 


Thus, the asymptotic formula in (1) states that 
w(x) 
2 lim — = 1. 
(2) hue TT 
Since A(n) = 0 unless n is a prime power we can write the definition of 
W(x) as follows: 


co 


y= TAM= $ Ag) X ler 


=1 psxi/m 
p™sx 


The sum on mis actually a finite sum. In fact, the sum on pis empty i x'/" < 2, 


that is, if (1/m)log x « log 2, or if 


Therefore we have 


wx)-2 X X logp. 


m< log2x px x!/m 


This can be written in a slightly different form by introducing another 
function of Chebyshev. 


Definition If x > 0 we define Chebyshev's 9-function by the equation 
Hx) = ¥ log p, 
psx 


where p runs over all primes <x. 
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> 
4: Some elementary theorems on the distribution of prime numbers 


The last formula for i/(x) can now be restated as follows: 


(3) Wx) = 9m. 


ms logax 
The next theorem relates the two quotients /(x)/x and 9(x)/x. 
Theorem 4.1 For x > 0 we have 
o < Ved) _ 909 , (log x)* — 
x x 2 / x log 2 


Note. This inequality implies that 


im (#2 - 28). 0 


x x 


x= o 


In other words, if one of Y(x)/x or 9{x)/x tends to a limit then so does the other 
and the two limits are equal. 


ProorF. From (3) we find 
0< W(x) - Hx)= Y Ix). 


2<mS logzx 
But from the definition of 9(x) we have the trivial inequality 


> 


Ax) < Y log x < x log x 


psx 
so 
0x46)—96)s Y  x!"log(x!") < (log, x) /x log J/x 
2m logox 
* logx Jx aa /x(log x)? 
Iam. DIE EE) 
"Now divide by x to obtain the theorem. a 


4.3 Relations connecting 9(x) and x(x) 


In this section we obtain two formu 
used to show that the prime num 
relation 


las relating 9(x) and n(x). These will be 
ber theorem is equivalent to the limit 


lim $e) zl. 
xo X 
Both functions x(x) and 9(x) are ste 


P functions with jumps at the primes; 
n(x) has a jump 1 at each prime p, f ee 


whereas 9(x) has a jump of log pat p. Sums 
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4.3: Relations connecting 3(x) and x(x) 


involving step functions o1 this type can be expressed as integrals by means of 
the following theorem. " 


Theorem 4.2 Abel's identity. For any arithmetical function a(n) let 
A(x) = Y, an) 


nsx 
where A(x) = Oifx < 1. Assume f has a continuous derivative on the interval 


Ly, x], where 0 < y < x. Then we have 


à — X aft) = ADS -ASO ~ [ AOsoae. 


y<ns 


Proor. Let k = [x] and m = [y], so that A(x) = A(k) and A(y) = A(m). 
Then 


k k 
Y amfm= X aes (n) = "E. fale — A(n — 2) f(n) 


y<nsx n=m 
k k-1 
= X Amf- Z AmS + 1) 
= Y A()Cf(n) — fin + D) + ASK) — A(m)f(m + 1) 
n=m+1 
k-1 ntl 
= = y aw f” SO di A009 — Alm sen + D 


SE ("arto dt A00 — Alm fim + 0 


n-m*i on 


- fF nro dt + ASO) — Í POL 


mi 
-4pfo»- [Asa 
= Ase) - 400/00 — [ “Aso di. D 


ALTERNATE PROOF. A shorter proof of (4) is available to those readers 
familiar with Riemann-Stieltjes integration. (See [2], Chapter 7) Since A(x) 
is a step function with jump f(n) at each integer n the sum in (4) can be 
expressed as a Riemann-Stieltjes integral, 


Y anre) = | Oaa 


y<nsx 


TI 


4: Some elementary theorems on the distribution of prime numbers 
Integration by parts gives us 


y<nsx 


E aft) = fG9469 — f()4Q) — | A() df (t) 
= fA) — FOA) — [40 s(t) dt. D 


Note. Since A(t) — O ift « 1, when y « 1 Equation (4) takes the form 


6) Late) sto) = asses) - SAOSA de 


nsx 


It should also be noted that Euler’s summation formula can easily be 
deduced from (4). In fact, if a(n) = 1 for alln > 1 we find A(x) = [x] and (4) 
implies 


Y fo) = food - OD- fioa. 
y 


y*nsx 


Combining this with the integration by parts formula 


[iro = 7 - xo) [roa 
y y 
we immediately obtain Euler’s summation formula (Theorem 3.1). 
Now we use (4) to express 9(x) and n(x) in terms of integrals. 


Theorem 4.3 For x > 2 we have 


(6) I(x) = n(x)log x — [: a dt 
2 
and 
3(x) * 91) 
7. Se 
(7) nts) = oy ji Tort 


PRoor. Let a(n) denote the characteristic function of the primes; that is, 


l ifnis prime 
a(n) = d 
tn) ls otherwise. 
Then we have 


= l= = 
n(x) m 20 and 9(x) = os p= 


M a(n)log n. 


l«nxx 
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4.4: Some equivalent forms of the prime number theorem 


Taking f(x) = log x in (4) with y = 1 we obtain 
(x) = 2 aimlog n = n(x)log x — z(1)log 1 — Í O gr 
i<n 1 


which proves (6) since n(t) = 0 for t < 2. 
Next, let b(n) = a(n) log n and write 


x)= > bin) iz , o 9(9)- Yb(n) 
3/2<nsx nsx 
Taking f(x) = 1/log x in (4) with y = 4 we obtain 
Gx) 932) n = (t) 
logx log 3/2 an t log’ t 


n(x) = 


which proves (7) since 9(t) = O if t < 2. o 


4.4 Some equivalent forms of the prime 
number theorem 
Theorem 4.4 The following relations are logically equivalent: 


(8) im NT 


9) TEGET 


x-o 
(10) dia teed 
x-o 
Pnoor. From (6) and (7) we obtain, respectively, 
9(x) mG)legx _ D js a(t) dt 
a Fal als 


2 x 


and 
m(x)ogx _ 9(x) x logx i M(t) dt 
3 me eens Jz AODA 1 


es (9) we need only show that (8) implies 


e 1 is n(t) Lor 


xo * 


To show that (8) impli 


1 
But (8) implies “© = = Coe: a) for t > 20 


ifs WO) 4 = «i pi) 
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Now 
* df v= dt "dox  x- Vx 
2 logt js log t [ies log J/x 
so 
> E as x — oo. 


This shows that (8) implies (9). ANS 
To show that (9) implies (8) we need only show that (9) implies 
a loge T S(t) dt es 
Rm 2 tlog? t 


xix X 


But (9) implies 9(t) — O(t) so 


log x (* 9(t) dt = of 28% = d ) 


x J, tlog?t x J, log? t 
Now 
* dt ME di = d xx R= fe 
i-i tes 
2logt J, logřt J zlogi log log? /x 
hence 


logx (* dt 
X ds loge 


20 asx o. 


This proves that (9) implies (8), so (8) and (9) are equivalent. We os 
already, from Theorem 4.1, that (9) and (10) are equivalent. 


The next theorem ri 


elates the prime number theorem to the asymptotic 
value of the nth prime, 


Theorem 4.5 Let Pa denote the nth prime. Then the following asymptotic 
relations are logically equivalent: 


(1) lim Zog x _ | 


x0 x 


(12) lim zog t) E 


x00 


: Pn 
I - 
(a) D. nlogn ! 
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4.4: Some equivalent forms of the prime number theorem 


Proor. We show that (11) implies (12), (12) implies (13), (13) implies (12), and 
(12) implies (11). 
Assume (11) holds. Taking logarithms we obtain 


lim [log n(x) + log log x — log x] = 0 


xo 


lim [ios (o x60. darle _ JI - 0. 


or 
SEU log x log x 
Since log x — oo as x > co it follows that 


lim log n(x) ip log log x 1)=0 
log x log x 


xo 


from which we obtain 


gs MESS. 
xo log x 


This, together with (11), gives (12). 
Now assume (12) holds. If x = p, then (x) = n and 


n(x)log n(x) = n log n 


so (12) implies 


l 
nlgn , 


lim 
na Pn 
Thus, (12) implies (13). 
Next, assume (13) holds. Given x, define n by the inequalities 
Pn S X € Posi 


so that n = n(x). Dividing by n log n, we get 
Pn x Pat. _ Pn+1 (n + Ilog(n + 1) 
ME: ae = " 
nlogn nloggn nlogn (n + 1)log(n + 1) nlogn 


Now let n — oo and use (13) to get 


LN or lim S 
posi logn ' zoe Mxlog n(x) ' 


Therefore, (13) implies (12). 


Finally, we show that (12) implies (11). Taking logarithms in (12) we 


obtain 
lim (log n(x) + log log x(x) — log x) = 0 
x> 
or 
, log log a(x) _ log x E 
dim [108 sex 25 log n(x) log n(x) 
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Since log z(x) — oo it follows that 


5 log log z(x) logx 
im ( " log n(x) log 5) à 


or 


i log x x 
z-a loga) 
This, together with (12), gives (11). El 


4.5 Inequalities for n(n) and p, 


The prime number theorem states that n(n) ~ n/log n as n — co. The in- 
equalities in the next theorem show that n/log n is the correct order of 
magnitude of x(n). Although better inequalities can be obtained with greater 


effort (see [60]) the following theorem is of interest because of the elementary 
nature of its proof. 


Theorem 4.6 For every integer n > 2 we have 
lon n 
14 > j — 
aa blogs Oat n 
Pnoor. We begin with the inequalities 
as) r< ba <4, 
n 


2n (Qn)! , i ; i . A r 
where Ese ia binomial coefficient. The rightmost inequality. 


follows from the relation 


æ (2n 2n 
4 = (14 1)"= 
‘ à ( k ) » ( n ) 
rA the other one is easily verified by induction. Taking logarithms in (15) we 
(16) nlog2 < log(2n)! — 2 logn! < n log 4. 
But Theorem 3.14 implies that 


logn! = ¥ a(p)log p 


Psn 
where the sum is extended over primes and o(p) is given by 


a(p) = b» [= 
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4.5: Inequalities for z(n) and p, 


Hence 
[el fad Ta 
(17) log2n! -2loggn!- Y. X {2 — 12. hos p. 
pS2n m-1 p pr 
Since [2x] — 2[x] is either 0 or 1 the leftmost inequality in (16) implies 
[557] 
nlog2< Y Y Ilogp < Y log2n = n(2n)log 2n. 
ps2n | m=1 ps2n 


This gives us 


nlogg2 2n log2 ! 2n 
logg2n log2n 2 4 log 2n 


(18) an) > 


since log 2 > 1/2. For odd integers we have 

1 2n sd 2n 2n 1 .1 2n 1 
4log2n ^ 42n + 1 log(2n + 1) ~ 6 log(2n + 1) 
since 2n/(2n + 1) > 2/3. This, together with (18), gives us 

1 n 
6 logn 


n(2n + 1) 2 n(2n) > 


n(n) > 


for all n > 2, which proves the leftmost inequality in (14). 
To prove the other inequality we return to (17) and extract the term 
corresponding to m = 1. The remaining terms are nonnegative so we have 


log2n)! —2logn!= > e] = af” hoe p. 
ps2n (LP P 


For those primes p in the interval n < p < 2n we have [2n/p] — 2[n/p] = 1 
so 
log(2n)! — 2 log n! > » log p = 9(2n) — 9(n). 
n<pS2n 

Hence (16) implies 
9(2n) — Mn) < n log 4. 

In particular, if n is a power of 2, this gives 

gQr*!) — 9(27) < 2" log 4 = 27*! log 2. 

Summing on r = 0, 1, 2, ..., k, the sum on the left telescopes and we find 
9(2k* 1) < 2**? log 2. 
Now we choose k so that 2* < n < 2**! and we obtain 

Bin) < 9(2** !) < 2**? log 2 < 4n log 2. 
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But if 0 < a < 1 we have 


(n(n) — n(n*)ogn* < Y! log p < Xn) < 4n log 2, 


n*«psn 


hence 


4 2 2- 
n log E 4n log 
alogn 


n(n) < 


a logn 


n ( log2 log ) 
=— tA) 
logn a n 


Now if c > 0 and x > 1 the function f(x) = x^* log x attains its maximum 
at x = e'^, so n^*logn < l/(ce) for n> 1. Taking « = 2/3 in the last 
inequality for (n) we find 


n 3 n 
no) eie (1082+ 2) <6 


This completes the proof. E 


Theorem 4.6 can be used to obtain upper and lower bounds on the size 
of the nth prime. 


Theorem 4.7 For n > 1 the nth prime p, satisfies the inequalities 
1 12 
(19) c" logn < p, « 12 nlogn + nlog—}. 
Proor. If k = p, then k > 2and n = n(k). From (14) we have 


k p, 
n= n(k) <6 = n 
(9 log k MUT 


hence 


1 1 
Po > Gn log p,» En log n. 


This gives the lower bound in (19). 
To obtain the upper bound we again use (14) to write 


from which we find 
(20) Pn < 6n log pp. 
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4.6: Shapiro’s Tauberian theorem 


Since log x < (2/e),/x if x > 1 we have log p, < (2/e)./ Pn , so (20) implies 
Pn < L, n. 
e 
Therefore 


un « log n 4- lo E 
3 98 Pr g e 


which, when used in (20), gives us 
Dis (2 logn + 2 log 2) 
e 
This proves the upper bound in (19). [m] 
Note. The upper bound in (19) shows once more that the series 
SS 
nz1 Pn 


diverges, by comparison with $7; 1/(n log n). 


4.6 Shapiro's Tauberian theorem 
We have shown that the prime number theorem is equivalent to the 


asymptotic formula 


(21) 1 Y A(n-1 asx- oo. 


X nsx 
In Theorem 3.15 we derived a related asymptotic formula, 
(22) » CH = x log x — x + O(log x). 


Both sums in (21) and (22) are weighted averages of the function A(n). 
Each term A(n) is multiplied by a weight factor 1/x in (21) and by [x/n] in (22). 
Theorems relating different weighted averages of the same function are 
called Tauberian theorems. We discuss next a Tauberian theorem proved in 
1950 by H. N. Shapiro [64]. It relates sums of the form Y, . a(n) with those of 


the form Xn <x a(n) [x/n] for nonnegative a(n). 
Theorem 4.8 Let {a(n)} be a nonnegative sequence such that 
x 
2 |= xlogx + O(x) for all x 2 1. 
Q3) zeli] xlogx + OQ) f 
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Then: 
(a) For x z 1 we have 


Y zo) — log x 4- O(1). 


nsx 


(In other words, dropping the square brackets in (23) leads to a correct 
result.) 
(b) There is a constant B > 0 such that 


$ a(n) < Bx for all x > 1. 


nsx 


(c) There is a constant A > 0 and an xo > 0 such that 


¥ a(n) = Ax for all x > xo. 


nsx 


Proor. Let 


Se) = Yan) T(x) = rai] 


nsx nsx 


First we prove (b). To do this we establish the inequality 


(24) S(x) - s) < T(x) - 21(5) 


We write 


norms) zo x [zo 
AGI n po 


Since [2y] — 2[ y] is either 0 or 1, the first sum is nonnegative, so 


T(x) — 2T z) B? A 2 x 
2 E = gh. n m ONE E 869 78 2 i 


This proves (24). But (23) implies 
x 


T) = 21(5) = x log x + O(x) — (5 log > d o) = 0(x). 


Hence (24) implies S(x) — S(x/2) = O(x). This means that there is some 
constant K > 0 such that | 


x 
S(x) — s) S Kx forallx 2 1. 


86 


4.6: Shapiro’s Tauberian theorem 


Replace x successively by x/2, x/4, .. . to get 


G- 


etc. Note that S(x/2") = 0 when 2" > x. Adding these inequalities we get 


I dp 
Ste) « (1 theft) =2Ks 


This proves (b) with B = 2K. 
Next we prove (a). We write [x/n] = (x/n) + O(1) and obtain 


TX) = X HZ =>) (: + 04 Jat =x Ds ao) o( a) 
nsx nsx M nsx ( nsx 


=x 5 a) , ove, 


nsx 
by part (b). Hence 


Y eo) - : T(x) + O(1) = log x + O(1). 


nsx 
This proves (a). 
Finally, we prove (c). Let 


A(x) = X aln). 
nsx 
Then (a) can be written as follows: 
A(x) = log x + R(x), 


where R(x) is the error term. Since R(x) = O(1) we have |R(x)| € M for 


some M > 0. 
Choose «to satisfyO < æ < 1 (weshall specify « more exactly in a moment) 


and consider the difference 


A()- Aex)- Y dn) ^ e y 


nsx n nsx n nsax n 
If x > 1 and ax > 1 we can apply the asymptotic formula for A(x) to write 


A(x) — Alax) = log x + R(x) — (log ax + R(ox)) 
= —log « + R(x) — R(ax) 
> —log « — |R(x)| — |R(@x)| > —log « — 2M. 


87 


4: Some elementary theorems on the distribution of prime numbers 


Now choose g so that —log x — 2M = 1. This requires log a = —2M — 1, 
ora = e-?M4-!, Note that 0 < « < 1. For this o, we have the inequality 


A(x) — A(ax) > 1 ifx > l/a. 
But 


S(x) 


Ay)- A= y Lx n= 


1 
ax<nsx ax 


È ain 
nsx 
Hence 


309 .. 1 if x > 1/a. 
ax 


Therefore S(x) > ax if x > 1/a, which proves (c) with A = a and xo = 1/a. 


4.7 Applications of Shapiro's theorem 


Equation (22) implies 
D vo] ] = x log x + O(x). 
nsx 
Since A(n) > 0 we can apply Shapiro’s theorem with a(n) = A(n) to obtain: 


Theorem 4.9 For all x > 1 we have 


(25) y w. log x + O(1). 


nsx 
Also, there exist positive constants c, and c, such that 


VG) xex forallx 21 
and 


y(x) > cox for all sufficiently large x. 


Another application can be deduced from the asymptotic formula 


> [= foe p = x log x + O(x) 


psx 


proved in Theorem 3.16. This can be written in the form 


Q9 X ^o] ]- plus boo 


nsx 
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4.8: An asymptotic formula for the partial sums „<x (1/p) 


where A, is the function defined as follows: 


_ flog p ifnisa prime p, 
^im lo otherwise. 


Since A,(n) > 0, Equation (26) shows that the hypothesis of Shapiro's 
theorem is satisfied with a(n) = A,(n) Since 9(x) = ),<, A,(n), part (a) of 
Shapiro’s theorem gives us the following asymptotic formula. 


Theorem 4.10 For all x > 1 we have 


(27) as ES = log x + O(1). 


psx 


Also, there exist positive constants c, and c; such that 
9(x)zex forallx 21 
and 


Xx) > cx for all sufficiently large x. 


In Theorem 3.11 we proved that 


20 x 
Z]- EG) 
for any arithmetical function f(n) with partial sums F(x) = Ys f(n). 
Since W(x) = Yy<x A(n) and 9(x) = Èn <x Ai(n) the asymptotic formulas in 


(22) and (26) can be expressed directly in terms of (x) and 9(x). We state 
these as a formal theorem. 


Theorem 4.11 For all x 2 1 we have 


Q8) » (2) = x log x — x + O(log x) 
nsx 

and 

» «3 = x log x + O(x). 


nsx 


4.8 An asymptotic formula for the partial 
sums )’p<x (1/p) 


In Chapter 1 we proved that the series Y. (1/p) diverges. Now we obtain an 
asymptotic formula for its partial sums. The result is an application of 
Theorem 4.10, Equation (27). 
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Theorem 4.12 There is a constant A such that 


f 1 
(29) 2i- loglogx + 4 + o( i a for all x > 2. 
pzxP 
Proor. Let 
lo 
a= Y 2? 
psx P 
and let 


1 ifnis prime, 
We {i otherwise. 


Then 


and A(x) = Y 49) tog y. 


psxP nsx nsx 


Therefore if we take f(t) = 1/log t in Theorem 4.2 we find, since A(t) = 0 for 
tez 


(30) es 


Kp logx Jz tlogt 


From (27) we have A(x) = log x + R(x), where R(x) = O(1). Using this on 
the right of (30) we find 


1 log x + O(1) m n log t 4- R(t) 


psxP log x 2 tlog?t 
(31) x x 
M EA E 
log x 2 tlogt t log’ t 

Now 

nab 

b Blog, ER log log 2 
and 


* R(t) lle R(t) = R(t) 
dt = = 
ip tlog? t 2 tlog? ;* [ rag coh 


the existence of the improper integral being assured by the condition R(t) = 


O(1). But 
^ R(t) IE 1 
dt — O| 
li t log? t (f t log? z) (sex) 


4.9: The partial sums of the Möbius function 


Hence Equation (31) can be written as follows: 


1 o R(t) 1 
— = log! 1 — log log 2 ar —,]- 
li og log x + og log 2 + jl Toei dt + es j 


This proves the theorem with 
^ RO 
A = 1 — loglog2 — 
adir 
4.9 The partial sums of the Móbius function 
Definition If x 2 1 we define 


M(x) = Y. p(n). 


The exact order of magnitude of M(x) is not known. Numerical evidence 
suggests that 


IM(x)| <x ifx>1, 


but this inequality, known as Mertens’ conjecture, has not been proved nor 
disproved. The best O-result obtained to date is 


M(x) = O(x6(x)) 


where ó(x) = exp{—A log? x(log log x) 1/5) for some positive constant A. 
(A proof is given in Walfisz [75].) 
In this section we prove that the weaker statement 


jn Olo 


is equivalent to the prime number theorem. First we relate M(x) to another 
weighted average of p(n). 
Definition If x > 1 we define 

H(x) = }, u(n)log n. 


nsx 


The next theorem shows that the behavior of M(x)/x is determined by 
that of H(x)/(x log x). 


Theorem 4.13 We have 


(32) lim (m uH )- 0 


xo, X xlogx 
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Proor. Taking f(t) = log t in Theorem 4.2 we obtain 
* M(t 
H(x) = Y; u(n)log n = M(x)log x — f o dt. 
nsx 1 


Hence if x > 1 we have 


M(x) | Hx) _ 1 fea 
x xlgx xlogxJ, t.s 


Therefore to prove the theorem we must show that 


E "Mi, ie 
(33) lim x log x Í r pio 


But we have the trivial estimate M(x) = O(x) so 


[^9 dt — of f'ar) = O(x), 


from which we obtain (33), and hence (32). DH 
Theorem 4.14 The prime number theorem implies 


lim Mid =0. 


xo X 


Proor. We use the prime number theorem in the form Wx) ~ x and prove 


that H(x)/(x log x) + 0 as x > oo. For this purpose we shall require the 
identity 


(34) —H(x) = — Y u(n)ogn- Y, ww(s) 
nsx 


nsx 


To prove (34) we begin with Theorem 2.1 1, which states that 
Aln) = — Y. u(d)log d 
din 


and apply Móbius inversion to get 


—u(nlog n = y uw). 
d|n d 


Summing over all n < x and using Theorem 3.10 with f = u, g = A, we 
obtain (34). 


Since W(x) ~ x, if c > 0 is given there is a constant A > Osuch that 
yo — 
x 


1]<e whenever x > A. 
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In other words, we have 
(35) |W(x) — x| «€ ex whenever x > A. 


Choose x > A and split the sum on the right of (34) into two parts, 
QUE PIS 
nsy ysnsx 


where y = [x/A]. In the first sum we have n < y so n < x/A, and hence 
x/n > A. Therefore we can use (35) to write 


Thus, 
x S XN. x 
Jr.) - xp (t e«)-1) 
xY u(n) » re " 5) 
nsy nsy n 
so 
Zeon) p x pO) - 


«xc EY Lux ex(l + log y) 
asy 


< x + EX + ex log x. 
In the second sum we have y < n < xson> y + 1. Hence 


x 


ype 4 


x 
IX 
n 
because 
ZOE FL 
ys A y . 
The inequality (x/n) < A implies V(x/n) < (A). Therefore the second sum is 
dominated by x/(A). Hence the full sum in (34) is dominated by 
(1 + &)x + ex log x + x(A) < (2 + W(A))x + ex log x 
ife < 1. In other words, given any e such that 0 < £ < 1 we have 
| H(x)| < (2 + V(A)x + exlogx ifx > A, 
or 


|H(x)| | 2 + WA) 
AA eei AME 
xlogx log x 
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Now choose B > A so that x > B implies (2 + W(A))/log x < e. Then for 
x > B we have 
IH) _ 5 
x log x 


> 


which shows that H(x)/(x log x) ^ 0 as x > oo. 


We turn next to the converse of Theorem 4.14 and prove that the relation 


MC) 


(36) lim =0 


xo 


implies the prime number theorem. First we introduce the “little oh” 
notation. 


Definition The notation 
f(x)-o(g(x) asx- oo (read: f(x) is little oh of g(x)) 
means that 


f(x) 
2m ate) g(x) Fe 
An equation of the form 
f(x) = h(x) + o(g(x)) as x ^ oo 


means that f(x) — h(x) — o(g(x)) as x — oo. 
Thus, (36) states that 


M(x) = o(x) as x 2 oo, 


and the prime number theorem, expressed in the form y/(x) ~ x, can also be 
written as 


W(x) = x +o(x) as x> oo. 
More generally, an asymptotic relation 
f(x) ~ g(x) asx oo 
is equivalent to 
f(x) = g(x) + o(g(x)) as x — oo. 

We also note that f(x) — O(1) implies f(x) — o(x) as x — oo. 
Theorem 4.15 The relation 

(37) M(x) = o(x) asx — oo 


implies (x) ~ x as x > «c. 
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Pnoor. First we express y/(x) by a formula of the type 
(38) y(x)-x-— Y n(d)f (a) + O(1) 
E 
and then use (37) to show that the sum is o(x) as x > oo. The function f 
in (38) is given by 
f(n) = ao(n) — log n — 2C, 


where C is Euler's constant and o,(n) = d(n) is the number of divisors of n. 
To obtain (38) we start with the identities 


b= 3 W(x) = Y, Aln), i= xL] 


nsx nsx 


and express each summand as a Dirichlet product involving the Móbius 
function, 


i= Estee). Ain) = Y. ad) ton 7, [:]- Yu. 


din din 
A 2C| l 
P E 
n n 
= d =] — log- -— 2c} 
Eje" 


Y, u(a)co(q) — log q — 2C) 
des 


= » uld) f(a). 


qdsx 


Then 


[x] — vx) — 2C 


ll 
“a 


This implies (38). Therefore the proof of the theorem will be complete if we 
show that 
(39) p u(d) f(g) = o(x) asx oo. 

ne 


For this purpose we use Theorem 3.17 to write 


(40) — =Yu@f@= E une(=) + somf) — F(a)M(b) 


qd<x 
where a and b are any positive numbers such that ab = x and 


Fx) = } fin). 


nsx 
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We show next that F(x) = O(/x) by using Dirichlet's formula (Theorem 3.3) 


F co(n) = x log x + QC — 1)x + O( /x) 


nsx 


together with the relation 


¥ log n = log[x]! = x log x — x + O(log x). 


nsx 


These give us 


F(x) = Yoon) — Xlogn -2CY1 
nsx 


nsx nsx 
= x log x + (2C — 1)x + Ox) — (x log x — x + O(log x)) 
—2Cx + O(1) 
= O( /x) + O(log x) + O(1) = O(,/3). 
Therefore there is a constant B > 0 such that 
|F(x)| € B/x forall x > 1. 
Using this in the first sum on the right of (40) we obtain 


Yun) « By [s^m-5 
nsb n nzb V" 


Ja 
for some constant A > B > 0. 
Now let £ > 0 be arbitrary and choose a > 1 such that 


(41) 


Ack 
"E : 
Then (41) becomes 
(42) Y uwr(*) 3a 
nsb n 


for all x = 1. Note that a depends on £ and not on x. 
Since M(x) = O(x) as x — oo, for the same e there exists c > 0 (depending 


only on £) such that 


xe implies MOOI : 


*x' 
where K is any positive number. (We will specify K presently.) The second 
sum on the right of (40) satisfies 


d Z Ex _ex olf 


n 
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provided x/n > c for all n < a. Therefore (43) holds if x > ac. Now take 


_ r lfm 
git » n 
Then (43) implies 
(44) x. fi wm(= )| <ex provided x > ac. 
nsa 


The last term on the right of (40)'is dominated by 


|F(a)M(b)| < A/a |M(b)] < A /ab < eb ab = e /xb < ex 


provided that Jx 7 a, or x > a*. Combining this with (44) and (42) we 
find that (40) implies 


| » pld) fi «)| « 3ex 
adsx 


provided x > a? and x > ac, where a and c depend only on c. This proves (39). 
o 


Theorem 4.16 If 
Ae) = y A 
the relation 
(45) A(x) = o(1) asx — oo 


implies the prime number theorem. In other words, the prime number theorem 

is a consequence of the statement that the series 

un) 
ES n 

converges and has sum 0. 

Note. It can also be shown (see [3]) that the prime number theorem 
implies convergence of this series to 0, so (45) is actually equivalent to the 
prime number theorem. 

Pnoor. We will show that (45) implies M(x) = o(x). By Abel’s identity we 
have 


Mex) = Yu) EP 2 ce K A(t) dt, 


so 


Me) = At HI " A(0 dt. 
1 
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Therefore, to complete the proof it suffices to show that 
1f* 
(46) lim — f A(t) dt = 0. 
x=o X Ji 


Now if e > 0 is given there exists a c (depending only on £) such that | A(x)| 
< gif x > c. Since | A(x)| € 1 forall x > 1 we have 


l p LT c—-1. ax—c) 
= | A0dt| <|- | A - € -——4-———. 
3l 0) Ji pled ; [404 iene & if x 
Letting x — oo we find 
lim sup I f A(t) a| <e, 
xo |X v1 
and since e is arbitrary this proves (46). m| 


4.10 Brief sketch of an elementary proof of 
the prime number theorem 


This section gives a very brief sketch of an elementary proof of the prime 


number theorem. Complete details can be found in [31] or in [46]. The key 
to this proof is an asymptotic formula of Selberg which states that 


VGolog x 4- Y; ninw(*) = 2x log x + O(x). 
nsx 


The proof of Selberg’s formula is relatively simple and is given in the next 
section. This section outlines the principal steps used to deduce the prime 
number theorem from Selberg’s formula. 


First, Selberg’s formula is cast in a more convenient form which involves 
the function 


a(x) = e *(e*) — 1. 


Selberg’s formula implies an integral inequality of the form 
x cy 
(47) jot? <2 | [tol du dy 002. 
040 
and the prime number theorem is equivalent to showing that o(x) ^ 0 as 
x — co. Therefore, if we let 


C = lim sup|o(x)|, 


x> 


the prime number theorem is equivalent to showing that C = 0. This is 
proved by assuming that C > 0 and obtaining a contradiction as follows. 
From the definition of C we have 


(48) Jo(x)| < C + g(x), 
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where g(x) ^ 0 as x > ©. If C > 0 this inequality, together with (47), gives 
another inequality of the same type, 


(49) Jo(x)| < C + h(x), 


where 0 < C' < C and h(x) > 0 as x > ©. The deduction of (49) from (47) 
and (48) is the lengthiest part of the proof. Letting x > © in (49) we find that 
C < C', a contradiction which completes the proof. 


4.11 Selberg's asymptotic formula 


We deduce Selberg's formula by a method given by Tatuzawa and Iseki 
[68] in 1951. It is based on the following theorem which has the nature of an 
inversion formula. 


Theorem 4.17 Let F be a real- or complex-valued function defined on (0, co), 
and let 


G(x) = log x È, (2) 
nsx Ml 
Then 


F(x)log x + ae Am) = X wax(,). 
nsx M dsx d 
Proor. First we write F(x)log x as a sum, 


F(xlogx = X [ e; = PLOK Y ud). 


nsx 


Then we use the identity of Theorem zii; 
n 
A(n) = Y; adog 5 
am d 


to write 


x x n 
X (=) = Pu (5) » u(d)log d 


nsx 


Adding these equations we find 


i 
a 
A 
aM 
sl 
PETES 
LEE. 
be om d 
2 


F(9log x + YF (s aoe * + log d 
nsx MI n d 


In 


x ax 
F| = ju(d)log = 
LÀ ( pases 


Il 
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In the last sum we write n= qd to obtain 


y xr nes - unes: Y r(5)- iex). 


nsx d|n 


which proves the theorem. a 


"Theorem 4.18 Selberg’s asymptotic formula. For x > 0 we have 
V(x)log x + p A(n) W(2) = = 2x log x + O(x). 
Proor. We apply Theorem 4.17 to the function F,(x) = y(x) and also to 


F,(x) = x — C — 1, where C is Euler's constant. Corresponding to F, we 
have 


G,(x) = log x ©, o(2) = x log? x — x log x + O(log? x), 
nsx 


where we have used Theorem 4.11. Corresponding to F; we have 
x pa 
G(x) = lex Y (S) = log x Y E- om 1) 


-xlogxY +- (C + llogx Y 1 


nsx" nsx 


1 
=x log xia +C+ o(:)) — (C + 1)log x(x + O(1)) 
= x log? x — x log x + O(log x). 


Comparing the formulas for G,(x) and G;(x) we see that G,(x) — G(x) = 
O(log? x). Actually, we shall only use the weaker estimate 


G(x) — G(x) = 0/5). 


Now we apply Theorem 4.17 to each of F, and F, and subtract the two 
relations so obtained. The difference of the two right members is 


eoo) o). B) og 2) 00 


by Theorem 3.2(b). Therefore the difference of the two left members is also 
O(x). In other words, we have 


(o) - Ge - C- Dilegx e E v) 3 C "e rh siot 
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Rearranging terms and using Theorem 4.9 we find that 


iaei X «(Sj = (C= Dogx 


nsx 


coy, G zc iji + O(x) 


nsx MI 


= 2x log x + O(x). E 


Exercises for Chapter 4 


1. Let S = {1, 5,9, 13, 17, ...} denote the set of all positive integers of the form 
4n + 1. An element p of S is called an S-prime if p > 1 and if the only divisors of p, 
among the elements of S, are 1 and p. (For example, 49 is an S-prime.) An element 
n > Lin S which is not an S-prime is called an S-composite. 


(a) Prove that every S-composite is a product of S-primes. 
(b) Find the smallest S-composite that can be expressed in more than one way as a 


product of S-primes 
This example shows that unique factorization does not hold in S. 
2 Consider the following finite set of integers: 
T = (1,7, 11, 13, 17, 19, 23, 29}. 
(a) For each prime p in the interval 30 < p < 100 determine a pair of integers 
m, n, where m > 0 and n € T, such that p = 30m + n. 


(b) Prove the following statement or exhibit a counter example: 
Every prime p > 5 can be expressed in the form 30m 4- n, where m z 0 and 


neT J 
3. Let f (x) =x? +x + 41. Find the smallest integer x = Ofor which f(x)is composite. 
4. Let f(x) e ag + ax toc a,x" be a polynomial with integer coefficients, 
where a, > 0 and n > 1. Prove that f(x) is composite for infinitely many integers x. 


5. Prove that for every n > 1 there exist n consecutive composite numbers. 


6. Prove that there do not exist polynomials P and Q such that 


ECL 172:3,... 


7. Let ay «a; €: «a4 E X be a set of positive.integers such that no a; divides the 
product of the others. Prove that n € n(x). 
8. Calculate the highest power of 10 that divides 10001. 
9. Given an arithmetic progression of integers 
hh +k,h + 2k sh nh, oos 


where 0 < k < 2000. If h + nk is prime for n=t, t+ 1,...,t +r prove that 
r <9. In other words, at most 10 consecutive terms of this progression can be 


primes. 
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10. Let s, denote the nth partial sum of the series 
E 1 
i rr d) 


Prove that for every integer k > 1 there exist integers m and n such that Sm — s, 
= Ik 


11. Let s, denote the sum of the first n primes. Prove that for each n there exists an 
integer whose square lies between s, and s,, ,. 


Prove each of the statements in Exercises 12 through 16. In this group of 
exercises you may use the prime number theorem. 


12. If a > O and b > 0, then z(ax)/n(bx) ~ a/b as x > oo. 
13. If 0 < a < b, there exists an x, such that z(ax) < n(bx) if x > xo. 


14. If 0 < a < b, there exists an x such that for x > x there is at least one prime 
between ax and bx. 


15. Every interval [a,b] with 0 < a < b, contains a rational number of the form 
p/q, where p and q are primes. 


16. (a) Given a positive integer n there exists a positive integer k and a prime p such that 
10*n < p < 10*(n + 1). 
(b) Given m integers a,, ..., a, such that 0 < a; € 9 for i = 1,2,..., m, there exists 
a prime p whose decimal expansion has a, ..., a,, for its first m digits. 


17. Given an integer n > 1 with two factorizations n = [i= pi and n = Tifa: qis 
where the p; are primes (not necessarily distinct) and the q; are arbitrary integers 
>1. Let x be a nonnegative real number. i 
(a) If« > 1 prove that 


Der < X qi. 


(b) Obtain a corresponding inequality relating these sums if 0 < a < 1. 


18. Prove that the following two relations are equivalent: 


(a) n(x) => + e | 
log x log? x, 
©) x) = x + e) 
log x, 
19. If x > 2, let 


: * dt ; 
Li(x) = ji ET (the logarithmic integral of x). 
(a) Prove that 


x 5 * dt 2 
logx J, log?t log2’ 


Li(x) = 
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and that, more generally, 


gs LASS "Uk 
e. MC i 2 log* an) Se d t Cae 


where C, is independent of x. 


e) 
2 log t log" x 


(b) If x 2 2 prove that 


. Let f be an arithmetical function such that 


Y. fp)log p = (ax + b)log x + cx + O(1) forx > 2. 


psx 


Prove that there is a constant A (depending on f) such that, if x > 2, 


Y sO) = ax + (a + cj — sfa ge) + boston) + 4 (cc a 


psx 


Given two real-valued functions S(x) and T(x) such that 


T(x) = DEG ) for all x > 1. 
nsx 
If S(x) = O(x) and if c is a positive constant, prove that the relation 
S(x) ~ cx asx- o 
implies 
T(x) ~ cxlogx asx- oo. 


Prove that Selberg’s formula, as expressed in Theorem 4.18, is equivalent to each of 
the following relations: 


(a) Vlog x + Y Te os p = 2x log x + O(x). 
psx 

(b) S(x)log x + È Jog p = 2x log x + O(x). 
psx 


Let M(x) = Ynsx H(n). Prove that 


M(x)logx + Y (Sh) = 06) 


asx 
and that 
M(x)log x + p rc “oe p = O(x). 
[Hint: Theorem 4.17.] 
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24. Let A(x) be defined for all x > 0 and assume that 


To) = 3; a) = ax log. x + bx e 


a ) as x — oo, 
nsx 


where a and b are constants. Prove that 


Alxlogx+ Y aE = 2ax log x + o(x log x) asx- oo. 


nsx 
Verify that Selberg’s formula of Theorem 4.18 is a special case. 


25. Prove that the prime number theorem in the form V(x) ^ x implies Selberg's 
asymptotic formula in Theorem 4.18 with an error term o(x log x) as x — oo. 


26. In 1851 Chebyshev proved that if y(x)/x tends to a limit as x + oo then this limit 
equals 1. This exercise outlines a simple proof of this result based on the formula 


(50) X (2) = x log x + O(x) 
nsx 
which follows from Theorem 4.11. 
(a) Let 6 = lim sup(/(x)/x). Given £ > 0 choose N = N(é) so that x = N implies 


x+0 
W(x) € (ô + e). Split the sum in (50) into two parts, one with n < x/N, the 
other with n > x/N, and estimate each part to obtain the inequality 


X (2) < (5 + ex log x + x¥(N). 
asx 

Comparing this with (50), deduce that 5 > 1. 
(b) Let y = lim inf(J(x)/x) and use an argument similar to that in (a) to deduce that 


xo 


? < 1. Therefore, if y(x)/x has a limit as x — oo then y=ô=1. 


: In Exercises 27 through 30, let A(x) = „<x a(n), where a(n) satisfies 


(51) a(n) >0 for all n > 1, 
and 
x 
(52) > 4(2) = XH = axlogx + bx + o|[ —— | asx oo. 
nsx AH nsx n log x 


When a(n) = A(n) these relations hold witha = 1andb = —1. The following 
exercises show that (51) and (52), together with the prime number theorem, 
W(x) ~ x, imply A(x) ~ ax. This should be compared with Theorem 4.8 
(Shapiro’s Tauberian theorem) which assumes only (51) and the weaker 
condition 5^... A(x/n) = ax log x + O(x) and concludes that Cx « A(x) 
X Bx for some positive constants C and B. ` 


27. Prove that 


@) ED = 5 (Pn +5 (Sun + Ox) 
nsJ= n 


nsx MÀ n as/x 
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and use this to deduce the relation 
CLP DEBE 1 > x — 2a 
x -xlogxasys M 5 xiu" n are 
28. Let a = lim inf(A(x)/x) and let B = lim sup(A(x)/x). 


x70 


xo 


(a) Choose any e > 0 and use the fact that 
(2) «(B at and (2) «(c at 


for all sufficiently large x/t to deduce, from Exercise 27(b), that 


fae pe Eq 8S o; 
2.19732. 2 4 
Since c is arbitrary this implies 
p a 
a+ +z 22a. 
"dos 


[Hint: Let x > oo in such a way that A(x)/x > a] 
(b) By a similar argument, prove that 


x a 
B+5+55% 
and deduce that « = f = a. In other words, A(x) ~ ax as x ^ oo. 


29. Take a(n) = 1 + y(n) and verify that (52) is satisfied with a = 1 and b = 2C — 1, 
whare C is Euler's constant. Show that the result of Exercise 28 implies 


lim b Y aln) = 0. 


xm X nsx 
This givés an alternate proof of Theorem 444. 


30. Suppose that, in Exercise 28, we do not assume the prime number theorem. Instead, 
let 
y = lim nt, ó- Bi sup A. 
x9 X mm cod 
(a) Show that the argument suggested in Exercise 28 leads to the inequalities 
B aô a ay 
aee a, oped, 
vtt p B+5+><% 
(b) From the inequalities in part (a) prove that 
ay € a < f < aô. 
This shows that among all numbers a(n) satisfying (51) and (52) with a fixed a, 
the most widely separated limits of indetermination, 


lim inf 469 and lim sup AO 
x-o x-o x 
occur when a(n) = aA(n). Hence to deduce A(x) ~ ax from (51) and (52) it 
suffices to treat only the special case a(n) = a(n). 
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5.1 Definition and basic properties of 
congruences i 


Gauss introduced a remarkable notation which simplifies many problems 
concerning divisibility of integers. In so doing he created a new branch of 
number theory called the theory of congruences, the foundations of which are 
discussed in this chapter. 


Unless otherwise indicated, small latin and Greek letters will denote 
integers (positive, negative, or zero) 


Definition Given integers a, b 
b modulo m, and we write 


(1) 


,m with m > 0. We Say that a is congruent to 


a = b (mod m), 


if m divides the difference a — b. The number m is called the modulus of 
the congruence, 


In other words, the congruence (1) is equivalent to the divisibility relation 
m|(a — b). 
In particular, a = 0 (mod m) if, and only if, m|a. Hence a =b (mod m) if, 
and only if, a — b = 0 (mod m). If m ¥ (a — b) we write a # b (mod m) and 
say that a and b are incongruent mod m. 
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EXAMPLES 

1. 19 =7 (mod 12, 1 = —1 (mod 2, 3* = —1 (mod 5). 
2. n is even if, and only if, n = 0 (mod 2). 

3. n is odd if, and only if, n = 1 (mod 2). 

4. a = b (mod 1) for every a and b. 

5. Ifa = b (mod m) then a = b (mod d) when d|m, d > 0. 


The congruence symbol = was chosen by Gauss to suggest analogy with 
the equals sign =. The next two theorems show that congruences do indeed 
possess many of the formal properties of equations. 


Theorem 5.1 Congruence is an equivalence relation. That is, we have: 


(a) a =a (mod m) (reflexivity) 
(b) a = b (mod m) implies b = a (mod m) (symmetry) 
(c) a = b (mod m) and b = c (mod m) 

imply a = c (mod m) (transitivity). 


Pnoor. The proof follows at once from the following properties of divisi- 
bility: 
(a) m|0. 


(b) If m|(a — b) then m|(b — a). 
(c) If m|(a — b) and m|(b — c) then m|(a — b)+(b-—c)=a-c. o 


Theorem 5.2 Ifa = b (mod m) and a = ß (mod m), then we have: 


(a) ax + «y = bx + By (mod m) for all integers x and y. 

(b) aa = bf (mod m). 

(c) a" = b" (mod m) for every positive integer n. 

(d) f(a) = f(b) (mod m) for every polynomial f with integer coefficients. 


Pnoor. (a) Since m|(a — 5) and m|(x — f) we have 
m|x(a — b) + y(« — B) = (ax + ay) — (bx 4- By). 
(b) Note that ax — bB = ala — b) + b(x — f) = 0 (mod m) by part (a). 
(c) Takea =a and fj = b in part (b) and use induction on n. 
(d) Use part (c) and induction on the degree of f. o 


Theorem 5.2 tells us that two congruences with the same modulus can 
be added, subtracted, or multiplied, member by member, as though they were 
equations. The same holds true for any finite number of congruences with 
the same modulus. 

Before developing further properties of congruences we give two examples 
to illustrate their usefulness. 
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EXAMPLE 1 Test for divisibility by 9. An integer n > 0 is divisible by 9 if, 
and only if, the sum of its digits in its decimal expansion is divisible by 9. 


This property is easily proved using congruences. If the digits of n in decimal 
notation are do, à, ... , ap, then 


n — ao + 102, + 107a, +--- + 10*g,. 
Using Theorem 5.2 we have, modulo 9, 


10 z 1, 10? = 1,..., 10* 21 (mod 9) 


n= a + a; +---+ a, (mod 9), 
Note that all these congruences hold modulo 3 as well, so a number is 
divisible by 3 if, and only if, the sum of its digits is divisible by 3. 


EXAMPLE 2 The Fermat numbers F, = 2" + 1 were mentioned in the 
Historical Introduction. The first five are primes: 


Fo=3,  F,-5$ F,=17,  F,-25,  amdF,- 65,537. 
We now show that F; is divisible by 641 without explicitly calculating F;. 
To do this we consider the successive powers 2?" modulo 641. We have 


2224  2*-16, 28 = 256, 216 = 65,536 = 154 (mod 641), 
so 


2? = (154)? = 23,716 = 640 = —1 (mod 641). 
Therefore F; = 232 + 1 29 (mod 641), so Fs is composite. 
We return now to general properties of congruences. Common nonzero 
factors cannot always be cancelled from both members of a congruence as 
they can in equations. For example, both members of the congruence 


48 = 18 (mod 10) 
are divisible by 6, but if 


result, 8 = 3 (mod 10). Th 
be cancelled if the modulu: 


Theorem 5.3 [fc > 0 then 
=b (mod m) if, and only if, ac = be (mod mc). 


Pnoor. We have m|(b — a) if, and only if, cm|c(b — a). o 


cancellation law which c. 
e common factor, 


The next theorem describes a 


the modulus is not divisible by th anbe used when 
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Theorem 5.4 Cancellation law. If ac = bc (mod m) and if d = (m, c), then 


m 
a=b (moa z), 


In other words, a common factor c can be cancelled provided the modulus 
is divided by d = (m, c). In particular, a common factor which is relatively 
prime to the modulus can always be cancelled. 


Proor. Since ac = bc (mod m) we have 


mic 
m|c(a — b) s7 00-5 


But (m/d, c/d) = 1, hence m/d|(a — b). [s] 


Theorem 5.5 Assume a = b (mod m). If d|m and dļa then d|b. 


Pnoor. It suffices to assume that d > 0. If d|m then a = b (mod m) implies 
a = b (modd). But if d|a then a = 0 (modd) so b = 0 (mod d). o 


Theorem 5.6 Jf a = b (mod m) then (a, m) = (b, m). In other words, numbers 
which are congruent mod m have the same gcd with m. 

Pnoor. Let d — (a, m) and e — (b, m). Then d|m and d|a so d|b; hence d|e. 

Similarly, e|m, e|b, so e|a; hence e|d. Therefore d - e. ü 


Theorem 5.7 /f a = b (mod m) and if 0 < |b — a| < m, then a = b. 


Pnoor. Since m|(a — b) we have m < |a — b| unless a — b = 0. o 


Theorem 5.8 We have a = b (mod m) if, and only if, a and b give the same 
remainder when divided by m. 

Proor. Write a=mq+r, b=mQ +R, where 0xr«m and 

0 < R < m. Then a — b = r — R (mod m) and 0 < |r — R| < m. Now 

use Theorem 5.7. Oo 


Theorem 5.9 Jf a = b (mod m) and a = b (mod n) where (m, n) = 1, then 
a = b (mod mn). 
Proor. Since both m and n divide a — b so does their product since 
(m, n) = 1. oO 
5.2 Residue classes and complete residue 
systems 
Definition Consider a fixed modulus m > 0. We denote by 4 the set of all 
integers x such that x = a (mod m) and we call â the residue class a 
modulo m. 
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Thus, á consists of all integers of the form a + mq, where q = 0, +1, 
rs f 
^ The following properties of residue classes are easy consequences of this 
definition. 


Theorem 5.10 For a given modulus m we have: - 


(a) â = b if, and only if, a = b (mod m). 

(b) Two integers x and y are in the same residue class if, and only if, 
x = y (mod m). P 

(c) The m residue classes 1, 2, ... , ñ are disjoint and their union is the set 
of all integers. 


Pnoor. Parts (a) and (b) follow at once from the definition. To prove (c) we 
note that the numbers 0, 1, 2,..., m — 1 are incongruent modulo m (by 
Theorem 5.7). Hence by part (b) the residue classes 


Pa me —.. 
0, 1,2,...,m—1 


are disjoint. But every integer x must be in exactly one of these classes because 
x = qm-r where 0<r<m, so x =r (mod m) and hence x ef. Since 
0 = m this proves (c). QO 


Definition A set of m representatives, one from each of the residue classes 
1, 2, ..., ri, is called a complete residue system modulo m. 


EXAMPLES Any set consisting of m integers, incongruent mod m, is a complete 
residue system mod m. For example, 


(L2,...,m;  (612,...,m— 1}; 
{1,m + 2, 2m + 3, 3m + 4, ..., m?). 


Theorem 5.11 Assume (k, m) = 1. If {a,, 


: +++ Am} is a complete residue system 
modulo m, so is {kay,..., kan}. 


Proor. If ka; = ka; (mod m) then a; = a; (mod m) since (k, m) = I. 
Therefore no two elements in the set (ka,,. .., kam} are congruent modulo m. 
Since there are m elements in this set it forms a complete residue system. O 


5.3 Linear congruences 


Polynomial congruences can be studied in 
1 0 r much the same way that poly- 
nomial equations are studied in algebra. Here, however = deal p 


poly ae with integer coefficients so that the values of these poly- 
nomials will be integers when x is an integer. An i ead 
polynomial congruence E n integer x satisfying a 


Q) J(=) = 0 (mod m) 
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is called a solution of the congruence. Of course, if x = y (mod m) then 
f(x) = f(y) (mod m) so every congruence having one solution has in- 
finitely many. Therefore we make the convention that solutions belonging 
to the same residue class will not be counted as distinct. And when we speak 
of the number of solutions of a congruence such as (2) we shall mean the 
number of incongruent solutions, that is, the number of solutions contained 
in the set (1,2, ..., m) or in any other complete residue system modulo m. 
Therefore every polynomial congruence modulo m has at most m solutions. 


EXAMPLE | The linear congruence 2x = 3 (mod 4) has no solutions, since 
2x — 3 is odd for every x and therefore cannot be divisible by 4. 


EXAMPLE 2 The quadratic congruence x? = 1 (mod 8) has exactly four 
solutions given by x = 1, 3, 5, 7 (mod 8). 


The theory of linear congruences is completely described by the next three 
theorems. 


Theorem 5.12 Assume (a, m) = 1. Then the linear congruence 
(3) ax = b (mod m) 
has exactly one solution. 


Pnoor. We need only test the numbers 1, 2, ..., M, since they constitute a 
complete residue system. Therefore we form the products a, 2a, ...,ma. 
Since (a, m) — 1 these numbers also constitute a complete residue system. 
Hence exactly one of these products is congruent to b modulo m. That is, 
there is exactly one x satisfying (3). o 


Although Theorem 5.12 tells us that the linear congruence (3) has a unique 


solution if (a, m) = 1, it does not tell us how to determine this solution 
except by testing all the numbers in a complete residue system. There are 


more expeditious methods known for determining the solution; some of 


them are discussed later in this chapter. 
Note. If (a, m) = 1 the unique solution of the congruence ax = 1 (mod m) 


is called the reciprocal of a modulo m. Ifa’ is the reciprocal of a then ba’ is 
the solution of (3). 


Theorem 5.13 Assume (a, m) = d. Then the linear congruence 
(4) ax = b (mod m) 
has solutions if, and only if, d|b. 
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Proor. If a solution exists then d|b since d|m and dļa. Conversely, if d|b 


the congruence 
a b m 
=x=- d— 
aor (mo 7) 


has a solution since (a/d, m/d) = 1, and this solution is also a solution of (4). 


oO 
Theorem 5.14 Assume (a, m) = d and suppose that d|b. Then the linear 
congruence 
(5) ax = b (mod m) 


has exactly d solutions modulo m. These are given by 


m m m 
(6) MEL" cie Ma EE Ts, 


where t is the solution, unique modulo m/d, of the linear congruence 


a b m 
( ae my 
7) ax d (moa m) 


Proor. Every solution of (7) is also a solution of (5). Conversely, every 
solution of (5) satisfies (7). Now the d numbers listed in (6) are solutions of (7) 
hence of (5). No two of these are congruent modulo m since the relations 


LETT S Ue ST (mod m), withO<r<d0<s<d 


imply 


r5 = s (mod m), and hence r = s (mod d). 


But 0 < |r—s|<dsor=s. 

It remains to show that (5) has no solutions except those listed in (6). 
If y is a solution of (5) then ay = at (mod m) so y = t (mod m/d). Hence 
y = t + km/d for some k. But k = p (mod d) for some r satisfying 0 < r < d. 
Therefore 


m m 
ka =z (mod m) soyettre (mod m). 


Therefore y is congruent modulo m to one of the numbers in (6). This com- 
pletes the proof. : i 
In Chapter 1 we proved that the gcd of two numbers i i 
Chaj a and b is a linear 
combination of a and b. The same result can be deduced 
eae ed as a consequence of 
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Theorem 5.15 If (a, b) = d there exist integers x and y such that 
(8) ax + by — d. 


Pnoor. The linear congruence ax = d (mod b) has a solution. Hence there 
is an integer y such that d — ax = by. This gives us ax + by = d, as required. 


Note. Geometrically, the pairs (x, y) satisfying (8) are lattice points lying 
on a straight line. The x-coordinate of each of these points is a solution of the 
congruence ax = d (mod b). 


5.4 Reduced residue systems and the 
Euler-Fermat theorem 
Definition By a reduced residue system modulo m we mean any set of (rm) 
integers, incongruent modulo m, each of which is relatively prime to m. 


Note. (m) is Euler’s totient, introduced in Chapter 2. 


Theorem 5.16 If {a;, a2, . . . , pim} is a reduced residue system modulo m and 
if (k, m) = 1, then {kay, ka, . . . , kajq)) is also a reduced residue system 
modulo m. 


Proor. No two of the numbers ka; are congruent modulo m. Also, since 
(a;, m) = (k, m) = 1 we have (ka;, m) = 1 so each ka; is relatively prime 
to m. o 
Theorem 5.17 Euler-Fermat theorem. Assume (a, m) = 1. Then we have 


a®™ = 1 (mod m). 


Pnoor. Let {b;, b2, . - - , bom} be a reduced residue system modulo m. Then 
{ab,, ab2,..., Aboim} is also a reduced residue system. Hence the product of 
all the integers in the first set is congruent to the product of those in the 
second set. Therefore 


b, +++ bom = a b, -++ bom (mod m). 
Each b; is relatively prime to m so we can cancel each b; to obtain the theorem. 
Oo 
Theorem 5.18 If a prime p does not divide a then 
a^^! = 1 (mod p). 
Proor. This is a corollary of the foregoing theorem since g(p) = p — 1. O 
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Theorem 5.19 Little Fermat theorem. For any integer a and any prime p we 
have 
a? =a (mod p). 


Pnoor. If p J a this is Theorem 5.18. If p|a then both a? and a are congruent 
to 0 mod p. [z] 


The Euler-Fermat theorem can be used to calculate the solutions ofa 
linear congruence. 


Theorem 5.20 If (a, m) = 1 the solution (unique mod m) of the linear con- 
gruence 


(9) ax = b (mod m) 
is given by 
(10) x = ba?™-! (mod m). 


Pnoor. The number x given by (10) satisfies (9) because of the Euler-Fermat 
theorem. The solution is unique mod m since (a, m) = 1. 


EXAMPLE 1 Solve the congruence 5x = 3 (mod 24). 
Solution 
Since (5, 24) — 1 there is a unique solution. Using (10) we find 
x = 3.5929-! = 3.57 (mod 24) 
since (24) = @(3)p(8) = 2-4. Modulo 24 we have 5? = 1, and 
Ste 55 m], 525, so x = 15 (mod 24), 


EXAMPLE 2 Solve the congruence 25x = 15 (mod 120). 
Solution 

A Since d = (25, 120) = 5 and d|15 the congruence has exactly five solu- 
tions modulo 120. To find them we divide by 5 and solve the congruence 


Sx = 3 (mod 24). Using Example 1 and Theorem 5.14 we find that the five 
solutions are given by x = 15 + 24k,k = 0, 1, 2, 3, 4, or 


x = 15, 39, 63, 87, 111 (mod 120). 


5.5 Polynomial congruences modulo p. 
Lagrange’s theorem 


The fundamental theorem of algebra states that for every polynomial f 
of degree n > 1 the equation f(x) = 0 has n solutions among the complex 
numbers. There is no direct analog of this theorem for polynomial con- 
gruences. For example, we have seen that some linear congruences have no 
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solutions, some have exactly one solution, and some have more than one. 
Thus, even in this special case, there appears to be no simple relation between 
the number of solutions and the degree of the polynomial. However, for 
congruences modulo a prime we have the following theorem of Lagrange. 


Theorem 5.21 (Lagrange). Given a prime p, let 
S (x) = cg + CX +++ + 6.x" 
bea polynomial ofdegree n with integer coefficients such that c, # 0 (mod p). 
Then the polynomial congruence 
(11) f(x) = 0 (mod p) 


has at most n solutions. 


Note. This result is not true for composite moduli. For example, the 
quadratic congruence x? = 1 (mod 8) has 4 solutions. 


Proor. We use induction on n, the degree of f. When n = 1 the congruence 
is linear: 
c,X + Co = (mod p). 
Since c, # 0 (mod p) we have (c, p) = 1 and there is exactly one solution. 
Assume, then, that the theorem is true for polynomials of degree n — 1. 
Assume also that the congruence (11) has n + 1 incongruent solutions 
modulo p, say 
Xos Xis -+ -3 Xn» 


where f(x,) = 0 (mod p) for each k = 0, 1, ..., n. We shall obtain a contra- 
diction. We have the algebraic identity 


fG9) — f(xo) = Me — xo) = (x — Xo)g(x) 


=1 


where g(x) is a polynomial of degree n — 1 with integer coefficients and with 
leading coefficient c,. Thus we have 


f(x) — fx) = (x, — Xo)g(xx) = 0 (mod p), 


since f(x,) = f (Xo) = 0 (mod p). But x, — xo # 0 (mod p) if k # 0 so we 
must have g(x;) = 0 (mod p) for each k # 0. By this means that the con- 
gruence g(x) 2 0 (mod p) has n incongruent solutions modulo p, con- 
tradicting our induction hypothesis. This completes the proof. a 


5.6 Applications of Lagrange's theorem 


Theorem 5.22 If f(x) = co + cix t: + 6X isa polynomial of degree n 
with integer coefficients, and if the congruence 


f(x) = 0 (mod p) 
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has more than n solutions, where p is prime, then every coefficient of f is 
divisible by p. 


Proor. If there is some coefficient not divisible by p, let c, be the one with 
largest index. Then k < n and the congruence 


Co + cix +++ + Cx" = 0 (mod p) 
has more than k solutions so, by Lagrange’s theorem, p|c,, a contradiction. 
Il 


Now we apply Theorem 5.22 to a particular polynomial. 


Theorem 5.23 For any prime p all the coefficients of the polynomial 
f(x) = (x — 1)(x — 2)---(x— p + 1)— x”! 41 
are divisible by p. 
Pnoor. Let g(x) = (x — 1)(x — 2)---(x — p + 1). The roots of g are the 
numbers 1, 2,..., p — 1, hence they satisfy the congruence 
g(x) = 0 (mod p). 


By the Euler-Fermat theorem, these numbers also satisfy the congruence 
h(x) = 0 (mod p), where 


h(x) = x^^! — 1. 


The difference f(x) = g(x) — h(x) has degree p — 2 but the congruence 
f(x) = 0 (mod p) has p — 1 solutions, 1, 2,...,p — 1. Therefore, by 
Theorem 5.22, each coefficient of f (x) is divisible by p. 


Weobtain the next two theorems by considering two particular coefficients 
of the polynomial f(x) in Theorem 5.23. 


Theorem 5.24 Wilson's theorem. For any prime p we have 
(p — 1)!  —1 (mod p). 


Proor. The constant term of the 


iib Jiu polynomial f(x) in Theorem i 


Note. The converse of Wilson's theorem also holds. That is, if n > 1 and 
(n — 1)! = —1 (mod n), then n is prime. (See Exercise 5.1) 


Theorem 5.25 Wolstenholme's theorem. For any prime p > 5 we have 


P= N(p:—1)! 
y & ) 


= 2 
Ps = 0 (mod p°). 
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Proor. The sum in question is the sum of the products of the numbers 1, 
2,...,p — 1 taken p — 2 at a time. This sum is also equal to the coeffi- 
cient of —x in the polynomial 


g(x) = (x — I) — 2)---& — p + 1). 
In fact, g(x) can be written in the form 
g(x) = x^! — SyxP-? Sx — + Sp-3x? — Sp-2x + (p — D, 


where the coefficient S, is the kth elementary symmetric function of the roots, 
that is, the sum of the products of the numbers 1,2,..., p — 1, taken kata 
time. Theorem 5.23 shows that each of the numbers S,, S2,..-, S,-2 is 
divisible by p. We wish to show that S, ; is divisible by p?. 

The product for g(x) shows that g(p) — (p — 1)! so 


(p — D! e p^! Sip? + --- + Sap! — Sp-2P p — VI. 
Canceling (p — 1)! and reducing the equation mod p? we find, since p 7 5, 
PSp-2 = 0 (mod p°), 


and hence $,.., = 0 (mod p°), as required. Oo 


5.7 Simultaneous linear congruences. The 
Chinese remainder theorem 


A system of two or more linear congruences need not have a solution, even 
though each individual congruence has a solution, For example, there is no x 
which simultaneously satisfies x = 1 (mod 2) and x =0 (mod 4), even 
though each of these separately has solutions. In this example the moduli 2 
and 4 are not relatively prime. We shall prove next that any system of two or 
more linear congruences which can be solved separately with unique solu- 
tions can also be solved simultaneously if the moduli are relatively prime in 
pairs. We begin with a special case. 


Theorem 5.26 Chinese remainder theorem. Assume m,, . . . , m, are positive 
integers, relatively prime in pairs: 


(m,m)-1 ifiz k. 
Let b,, ..., b, be arbitrary integers. Then the system of congruences 


x = b, (mod m,) 


x = b, (mod m,) 


has exactly one solution modulo the product m, >>- m,. 
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> 


Proor. Let M = m, ---m,and let M, = M/[m,. Then (M,, m;) = 1 so each 
M, has a unique reciprocal M, modulo m,. Now let 


x = b,M,M1 b M; M5 + --- + b,M,M,. 
Consider each term in this sum modulo m,. Since M; = 0 (mod m,) if i # k 
we have 
x = b,M,M;, = b, (mod m}. 
Hence x satisfies every congruence in the system. But it is easy to show that the 
system has only one solution mod M. In fact, if x and y are two solutions of 


the system we have x = y (mod m,) for each k and, since the m, are relatively 
prime in pairs, we also have x = y (mod M). This completes the proof. C 


The following extension is now easily deduced. 
Theorem 5.27 Assume my, ...,m, are relatively prime in pairs. Let by, . . . , 
b, be arbitrary integers and let ay, ... , a, satisfy 
Gom) =1 fork =1,2,...,7. 
Then the linear system of congruences 


a,x = b, (mod m,) 


a,x = b, (mod m,) 
has exactly one solution modulo m,m; - - - m, 


ET 


Proor. Let a, denote the reciprocal of a, modulo m,. This exists since 
(a,, m,) = 1. Then the congruence a,x = b, (mod m,) is equivalent to the 
congruence x = b,a, (mod m,). Now apply Theorem 5.26. [e 


5.8 Applications of the Chinese remainder 
theorem 


The first application deals with polynomial congruences with composite 
moduli. 


Theorem 5.28 Let f be a polynomial with integer coefficients, let m,, m3, . . . , 
m, be positive integers relatively prime in pairs, and let m = mm; --- m,. 
Then the congruence 
(12) f(x) = 0 (mod m) 
has a solution if, and only if, each of the congruences 


(3) f(x) =0 (modm)  (i—12...,7) 
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has a solution. Moreover, if v(m) and v(m;) denote the number of solutions 
of (12) and (13), respectively, then 


(14) v(m) = v(m,)v(mz2) - - - v(m,). 


Pnoor. If f(a) = 0 (mod m) then f(a) = 0 (mod m;) for each i. Hence every 
solution of (12) is also a solution of (13). 

Conversely, let a; be a solution of (13). Then by the Chinese remainder 
theorem there exists an integer a such that 


(15) az a; (modm) fori — 1,2,...,r, 
so 
f(a) = f(a) = 0 (mod m). 


Since the moduli are relatively prime in pairs we also have f (a) = 0 (mod m). 
Therefore if each of the congruences in (13) has a solution, so does (12). 

We also know, by Theorem 5.26, that each r-tuple of solutions (a;, ..., a) 
of the congruences in (13) gives rise to a unique integer a mod m satisfying 
(15). As each a; runs through the v(m;) solutions of (13) the number of integers a 
which satisfy (15) and hence (13) is v(m,) - - - v(m,). This proves (14). [m] 


Note. If m has the prime power decomposition 
m= pi ++: Pe 


we can take m; = p; in Theorem 5.28 and we see that the problem of solving 
a polynomial congruence for a composite modulus is reduced to that for 
prime power moduli. Later we will show that the problem can be reduced 
further to polynomial congruences with prime moduli plus a set of linear 
congruences. (See Section 5.9.) 


The next application of the Chinese remainder theorem concerns the set 
of lattice points visible from the origin. (See Section 3.8.) 


Theorem 5.29 The set of lattice points in the plane visible from the origin 
contains arbitrarily large square gaps. That is, given any integer k > 0 
there exists a lattice point (a, b) such that none of the lattice points 


(at+r,b+s), 0<r<k,0<s<k, 
is visible from the origin. 


Pnoor. Let p,, p2, ..., be the sequence of primes. Given k > 0 consider 
the k x k matrix whose entries in the first row consist of the first k primes, 
those in the second row consist of the next k primes, and so on. Let m; be the 
product of the primes in the ith row and let M; be the product of the primes 
in the ith column. Then the numbers m; are relatively prime in pairs, as are 
the M;. 
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Next consider the set of congruences 
—1 (mod m) 
—2 (mod m;) 


x 
x 


x 2 —k (mod m}. 
This system has a solution a which is unique mod m, --- m,. Similarly, the 
system 

y = —1 (mod M,) 


= —k (mod M,) 


has a solution b which is unique mod M, --- M, = m; -++ m,. 
Now consider the square with opposite vertices at (a, b) and (a + k, b + k). 
Any lattice point inside this square has the form 


(arbos, where0<r<k0<s<k, 


and those with r = k or s = k lie on the boundary of the square. We now 
show that no such point is visible from the origin. In fact, 


= —r(modm, and b= —s (mod Mj) 


so the prime in the intersection of row r and column s divides both a + rand 
b + s. Hence a + r and b + s are not relatively prime, and therefore the 
lattice point (a + r, b + s) is not visible from the origin. 


5.9 Polynomial congruences with prime 
power moduli 
Theorem 5.28 shows that the problem of solving a polynomial congruence 
f(x) = 0 (mod m) 
can be reduced to that of solving a system of congruences 
f(x) = 0 (mod pë) ü= 1,2,...,7), 


where m = p,"!--- p,*. In this section we show that the problem can be 
further reduced to congruences with prime moduli plus a set of linear con- 
gruences. 

Let f bea polynomial with integer coefficients, and suppose that for some 
prime p and some « > 2 the congruence 


(16) f(x) = 0 (mod p°) 
has a solution, say x — a, where a is chosen so that it lies in the interval 


0<a<p* 
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This solution also satisfies each of the congruences f(x) = 0 (mod p^) for 
each f < a. In particular, a satisfies the congruence 


(17) f(x) = 0 (mod p*~'). 
Now divide a by p^! and write 
(18) a-qp' -r, whereO <r « p*^!. 


The remainder r determined by (18) is said to be generated by a. Since 
r =a (mod p*^!) the number r is also a solution of (17). In other words, 
every solution a of congruence (16) in the interval 0 < a < p* generates 
a solution r of congruence (17) in the interval 0 < r < p^ !. 

Now suppose we start with a solution r of (17) in the interval 0 < r < p*^! 
and ask whether there is a solution a of (16) in the interval 0 < a < p* which 
generates r. If so, we say that r can be lifted from p*~! to p*. The next theorem 
shows that the possibility of r being lifted depends on f(r) mod p* and on the 
derivative f'(r) mod p. 


Theorem 5.30 Assume « > 2 and let r be a solution of the congruence 
(19) f(x) = 0 (mod p*~*) 
lying in the interval 0 € r < p*-!, 


(a) Assume f'(r) 0 (mod p). Then r can be lifted in a unique way from 
D^! to p*. That is, there is a unique a in the interval 0 € a « p* which 
generates r and which satisfies the congruence 


(20) f(x) = 0 (mod p9. 


(b) Assume f'(r) = 0 (mod p). Then we have two possibilities: 
(bi) If f(r) = 0 (mod p°), r can be lifted from p*^! to p* in p distinct 
ways. 
(b2) If f(r) #0 (mod p°), r cannot be lifted from p^^! to p*. 


Pnoor. If n is the degree of f we have the identity (Taylor's formula) 


" (n) 
(21) f+ =f) + soo + Fp? 4... I1. 
for every x and h. We note that each polynomial f™(x)/k! has integer co- 
efficients. (The reader should verify this.) Now take x = r in (21), where r is a 
solution of (19) in the interval 0 < r < p*~', and let h = qp*^! where qisan 
integer to be specified presently. Since æ > 2 the terms in (21) involving h? 
and higher powers of h are integer multiples of p*. Therefore (21) gives us the 
congruence 


f(r + ap") = fr) + f'(r)ap^* (mod p9. 
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Since r satisfies (19) we can write f(r) = kp*^! for some integer k, and the 
last congruence becomes 


f(r ap^!) = {af '(r) + kp" * (mod p°). 


Now let 

(22) a=r+qp'. 

Then a satisfies congruence (20) if, and only if, q satisfies the linear congruence 
(23) qf'(r) + k = 0 (mod p). 


If f'(r) # 0 (mod p) this congruence has a unique solution q mod p, and if 
we choose q in the interval 0 € q « p then the number a given by (22) will 
satisfy (20) and will lie in the interval 0 < a < p*. 

On the other hand, if f'(r) = 0 (mod p) then (23) has a solution q if, 
and only if, p|k, that is, if and only if f(r) = 0 (mod p°). If p¥k there is no 
choice of q to make a satisfy (20). But if p|k then the p values B= sl, ovas 
p — 1 give p solutions a of (20) which generate r and lie in the interval 
0 <a < p*. This completes the proof. a 


The proof of the foregoing theorem also describes a method for obtaining 
solutions of congruence (20) if solutions of (19) are known. By applying the 


method repeatedly the problem is ultimately reduced to that of solving the 
congruence 


(24) f(x) = 0 (mod p). 


If (24) has no solutions, then (20) has no solutions. If (24) has solutions, 
we choose one, call it r, which lies in the interval 0 < r < p. Corresponding 
to r there will be 0, 1, or p solutions of the congruence 


(25) f(x) = 0 (mod p?) 
depending on the numbers f'(r) and k = f(r)/p. If p Y k and p|f'(r) then r 
cannot be lifted to a solution of (25). In this case we begin anew with a 


different solution r. If no r can be lifted then (25) has no solution. 
If p|k for some r, we examine the linear congruence 


af (r) + k = 0. (mod p). 
This has 1 or p solutions q according as p ¥ f'(r)or p| f(r). For each solution 
q the number a = r + qp gives a solution of (25). For each solution of (25) 
a similar procedure can be used to find all solutions of i 
f(x) = 0 (mod p°), 
and so on, until all solutions of (20) are obtained. 
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5.10 The principle of cross-classification 


Some problems in number theory can be dealt with by applying a general 
combinatorial theorem about sets called the principle of cross-classification. 
This is a formula which counts the number of elements of a finite set $ which 
do not belong to certain prescribed subsets S, ..., Sp- 


Notation Jf T is a subset of S we write N(T) for the number of elements of T. 
We denote by S — T the set of those elements of S which are not in T. Thus, 
S- S: 
i=1 
consists of those elements of S which are not in any of the subsets S,,..., Sn» 
For brevity we write S;S;, S;S;S,,..., for the intersections S; A Sj, 
Si^ Sj A S,,... , respectively. 


Theorem 5.31 Principle of cross-classification. Jf S4, . . . , S, are given subsets 
of a finite set S, then 


x(s- Us.) = mis) - Y NS)- Y NSS) 
iz1 isi 


Sisn lsi<jsn 


- X NSSS) + +++ + (—1)"N(S,S2 --- S,). 


1lsi<j<ksn 


Proor. If T € S let N,(T) denote the number of elements of T which are 
not in any of the first r subsets 5;,..., S,, with No(T) being simply N(T). 
The elements enumerated by N,.. ,(T) fall into two disjoint sets, those which 
are not in S, and those which are in S,. Therefore we have 


N,-1(T) = NT) + N,-1(TS,). 
Hence 
(26) NAT) = N,-,(T) — N,-4(TS,). 


Now take T = S and use (26) to express each term on the right in terms of 
N,-2. We obtain 


NS) = (N,-XS) — N--2(SS,-1)} — {N,-2(S,) — N,-2(S,S,—1)} 
= N,_2(S) — N,-2(S,-1) — N,-2(S,) + N,-2(S,S,- 1). 
Applying (26) repeatedly we finally obtain 


NS) = No(S) — -X No(S) + » NQ(S;Sj) — -++ + (-AYNé(S, --- S,). 
i=1 1<i<jsr 
When r = n this gives the required formula. o 
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EXAMPLE The product formula for Euler’s totient can be derived from the 
cross-classification principle. Let p,,...,p, denote the distinct prime 
divisors of n. Let S = {1, 2,..., n) and let S, be the subset of S consisting of 
those integers divisible by p,. The numbers in S relatively prime to n are those 
in none of the sets $,,..., Sp, SO 


oln) = (s - Us). 


If d|n there are n/d multiples of d in the set S. Hence 


N(S) =~, NS) = —,..., NS, S) = 
Di PiPj picco Pr” 


so the cross-classification principle gives us 


Lon n n 
y(n) =n + =+ (Ay 
p Pi Em PiPj ) picco Pr 
ud) ( ;) 
=n) —=n I =+]; 
Ban te 


The next application of the cross-classification principle counts the 
number of elements in a reduced residue system mod k which belong to a 
given residue class r mod d, where d|k and (r, d) = 1. 


Theorem 5.32 Given integers r, d and k such that d|k, d > 0, k = 1 and 
(r, d) = 1. Then the number of elements in the set 


S = (rt tdt = 1,2,..., k/d) 
which are relatively prime to k is q(k)/o(d). 


PROOF. If a prime p divides k and r + td then p ¥ d, otherwise p|r, contra- 
dicting the hypothesis (r, d) = 1. Therefore, the primes which divide k and 
elements of S are those which divide k but do not divide d. Call them p4, . . . , Pm 
and let 


K = pipa ot Pm: 
Then the elements of S relatively prime to k are those not divisible by any of 
these primes. Let 
S; = (x:xeS and pi|x) (1551. sah 


If x € S; and x =r + td, then r + td = 0 (mod pj). Since p; 7d there is a 
unique t mod p; with this property, therefore exactly one t in each of the 


intervals [1, pil, [p; + 1, 2p, ..., [(q — Dp: + 1, qp] where qp; = k/d. 
Therefore 


N(S) = En 
Pi 
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Similarly, 
k/d k/d 
N(S;S) = ——,..., N(S1 --- Sm) = — —. 
BA Y EE TEE 
Hence by the cross-classification principle the number of integers in S which 
are relatively prime to k is 


ac 
-D)-p tn) 2 9-m 2 


pid P, 


5.11 A decomposition property of reduced 
residue systems 


As an application of the foregoing theorem we discuss a property of reduced 
residue systems which will be used in a later chapter. We begin with a 
numerical example. 

Let S be a reduced residue system mod 15, say 


S = (1,2, 4, 7, 8, 11, 13, 14}. 


We display the 8 elements of S in a 4 x 2 matrix as follows: 


13 14 


Note that each row contains a reduced residue system mod 3, and the 
numbers in each column are congruent to each other mod 3. This example 
illustrates a general property of reduced residue systems described in the 
following theorem. 


Theorem 5.33 Let S be a reduced residue system mod k, and let d > 0 be a 
divisor of k. Then we have the following decompositions of S : 


(a) S is the union of q(k)/o(d) disjoint sets, each of which is a reduced residue 
system mod d. 

(b) S is the union of q(d) disjoint sets, each of which consists of q(k)/o(d) 
numbers congruent to each other mod d. 


Note. In the foregoing example, k — 15 and d — 3. The rows of the matrix 
represent the disjoint sets of part (a), and the columns represent the disjoint 
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sets of part (b). If we apply the theorem to the divisor d = 5 we obtain the 
decomposition given by the matrix 


Te) oc Sa} 
MPO ESI ER T 
Each row is a reduced residue system mod 5 and each column consists of 
numbers congruent to each other mod 5. 


Pnoor. First we prove that properties (a) and (b) are equivalent. If (b) holds 
we can display the ¢(k) elements of S as a matrix, using the q(d) disjoint sets 
of (b) as columns. This matrix has g(k)/g(d) rows. Each row contains a 
reduced system mod d, and these are the disjoint sets required for part (a). 
Similarly, it is easy to verify that (a) implies (b). 

Now we prove (b). Let S4 be a given reduced residue system mod d, and 
suppose r € S,. We will prove that there are at least (k)/e(d) integers n in S, 
distinct mod k, such that n = r (mod d). Since there are (d) values of r in 
S, and ¢(k) integers in S, there can't be more than €(k)/o(d) such numbers n, 
so this will prove part (b). 


The required numbers n will be selected from the residue classes mod k 
represented by the following k/d integers: 


nr dr-424,... r4. ka. 


d 
These numbers are congruent to each other mod d and they are incongruent 
mod k. Since (r, d) = 1, Theorem 5.32 shows that (k)/p(d) of them are 
relatively prime to k, so this completes the proof. (For a different proof based 
on group theory see [1].) o 


Exercises for Chapter 5 


1. Let S be a set of n integers (not necessaril 


y distinct). Prove that some nonempt 
subset of S has a sum which is divisible by onompiy, 


n. 
2. Prove that 5n? + 7n5 = 0 (mod 12) for all integers n. 


3. (a) Find all positive integers n for which n!3 = p (mod 1365). 


(b) Find all positive integers n for which n!7 = n (mod 4080). 


4. (a) Prove that ø(n) = 2 (mod 4) when n = 4 and when n = p°, where p is a prime, 
p = 3 (mod 4). 


(b) Find all n for which (n) = 2 (mod 4). 
5. A yardstick divided into inches is again divided into 70 ei 


among the four shortest divisions two have left endpoints 
19 inches. What are the right endpoints of the other two? 


qual parts. Prove that 
corresponding io 1 and 
6. Find all x which simultaneously satisfy the system of congruences 


x = 1 (mod 3), x = 2 (mod 4), x = 3 (mod 5). 
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7. Prove the converse of Wilson's theorem: If(n — 1)! + 1 = 0 (mod n), then nis prime 
ifn > 1. 
8. Find all positive integers n for which (n — 1)! + 1 is a power of n. 


10. 


11. 


12 


13. 


14. 


15. 


16. 


17. 


. If p is an odd prime, let q = (p — 1)/2. Prove that 


(q!)? + (—1) = 0 (mod p). 


This gives q! as an explicit solution to the congruence x? + 1 = 0 (mod p) when 
p = 1 (mod 4), and it shows that g! = +1 (mod p) if p = 3 (mod 4). No simple 
general rule is known for determining the sign. 


If p is odd, p > 1, prove that 


12325? ...(p — 2}? = (- 1)** ? (mod p) 


and 
27476? ...(p — 1)? = (—1)^* 9? (mod p). 
Let p be a prime, p > 5, and write 
DM c MAD 
2:043 p ps 


Prove that p? |(r — s). 


If p is a prime, prove that 


-Eeen 
°() 


Let a, b, n be positive integers such that n divides a" — b". Prove that n also divides 
(a" — b")/(a — b). 


Also, if p*|[n/p] prove that 


Let a, b, and x, be positive integers and define 
x,—ax,-i +b forn=1,2,... 
Prove that not all the x, can be primes. 


Let n, r, a denote positive integers. The congruence n? (mod 10°) implies 
n' = n (mod 10°) for all r. Find all values of r such that n” = n (mod 10°) implies 
n? = n (mod 10%). 


Let n, a, d be given integers with (a, d) = 1. Prove that there exists an integer m 
such that m = a (mod d) and (m, n) = 1. 


Let f be an integer-valued arithmetical function such that 


f(m + n) = f(n) (mod m) 
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for all m > 1, n > 1. Let g(n) be the number of values (including repetitions) of 


f(1), f), . ... f (n) divisible by n, and let h(n) be the number of these values relatively 
prime to n. Prove that 


hn) = n Y. a) © 


mi M 
18. Given an odd integer n > 3, let k and t be the smallest positive integers such that 


both kn + 1 and tn are squares. Prove that n is prime if, and only if, both k and t 
are greater than n/4. 


19. Prove that each member of the set of n — 1 consecutive integers 


n!-2,n! -3,..., n! 3n 
is divisible by a prime which does not divide any other member of the set. 


20. Prove that for any positive integers mand k, there exists a set of n consecutive integers 
such that each member of this set is divisible by k distinct prime factors no one of 
which divides any other member of the set. 


21. Let n be a positive integer which is not a Square. Prove that for every integer a 
relatively prime to n there exist integers x an 


d y satisfying 
ax = y (mod n) with 0 < x < /nand0 < lyl < Jn. 
22. Let p bea prime, p = 1 (mod 4), let q = (p — 1)/2, and let a = qi. 
(a) Prove that there exist positive inte; 


gers x and y satisfying 0 < x < Jp and 
O<y< Vp such that 


ax? — y? = 0 (mod p). 
(b) For the x and y in part (a), prove that p — 


p = 1 (mod 4) is the sum of two Squares. 


x? + y?, This shows that every prime 
(c) Prove that no prime p = 


3 (mod 4) is the sum of two Squares. 
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Finite Abelian Groups and 
Their Characters 


6.1 Definitions 


In Chapter 2 we had occasion to mention groups but made no essential use 
of their properties. Now we wish to discuss some elementary aspects of group 
theory in more detail. In Chapter 7 our discussion of Dirichlet’s theorem on 
primes in arithmetical progressions will require a knowledge of certain 
arithmetical functions called Dirichlet characters. Although the study of 
Dirichlet characters can be undertaken without any knowledge of groups, 
the introduction of a minimal amount of group theory places the theory of 
Dirichlet characters in a more natural setting and simplifies some of the 
discussion. 


Definition Postulates for a group. A group G is a nonempty set of elements 
together with a binary operation, which we denote by -, such that the 
following postulates are satisfied: 


(a) Closure. For every a and b in G, a - b is also in G. 

(b) Associativity. For every a, b, c in G, we have (a-b)-c = a- (bc). 

(c) Existence of identity. There is a unique element e in G, called the 
identity, such that a -e = e -a = a for every a in G. 

(d) Existence of inverses. For every a in G there is a unique element b in G 
such that a -b = b -a = e. This b is denoted by a^! and is called the 


inverse of a. 


Note. We usually omit the dot and write ab for a. b. 


Definition Abelian group. A group G is called abelian if every pair of elements 
commute; that is, if ab — ba for all a and b in G. 
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i ite if Gi i In this 

iti inite group. A group G is called finite if G is a finite set. ; 

Dee d xd EEAS in G is called the order of G and is denoted 
by |G}. 


Definition Subgroup. A nonempty subset G' of a group G which is itself a 
group, under the same operation, is called-a subgroup of G. 


6.2 Examples of groups and subgroups 


EXAMPLE 1 Trivial subgroups. Every group G has at least two subgroups, 
G itself and the set (e) consisting of the identity element alone. 


EXAMPLE 2 Integers under addition. The set of all inte: 


gers is an abelian group 
with + as the operation and 0 as the identity. The in 


verse of n is — n. 


ExAMPLE 3 Complex numbers under multiplication. The set of all non-zero 
complex numbers is an abelian group with ordinary multiplication of 
complex numbers as the operation and 1 as the identity. The inverse of z is the 


reciprocal 1/z. The set of all complex numbers of absolute value 1 is a 
subgroup. 


EXAMPLE 4 The nth roots of unity. The groups in Examples 2 and 3 are 
infinite groups. An example of a finite group is the set (1, e 2, .., g-1), 
where e = e^"! and the operation - is ordinary multiplication of complex 


numbers. This group, of order n, is called the group of nth roots of unity. It isa 
subgroup of both groups in Example 3. 


6.3 Elementary properties of groups 


The following elementary theorems concer: 


n an arbitrary group G. Unless 
otherwise stated, G is not required to be ab 


elian nor finite, 


Theorem 6.1 Cancellation laws. If elements a, b, c in G satisfy 


ac=be or ca= cb, 
thena = b. 


Pnoor. In the first case multiply each member o 


n the right by c~! and use 
associativity. In the second case multiply on the 


left by c^ !. [m] 
Theorem 6.2 Properties of inverses. In any group G we have: 

(a) e! =e. 

(b) For every a in G, (a^)! = a. 

(c) For all a and b in G, (ab)! = b7!a7!. (Note reversal of order.) 


(d) For all a and b in G the equation ax — b has the unique solution x — 
a^ !b; the equation ya = b has the unique solution y = ba- 1, 
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PROOF. 


(a) Since ee = ee~' we cancel e to obtain e = e 
(b) Since aa^! = e and inverses are unique, a is the inverse of a^ +. 
(c) By associativity we have 


(ab)(b^ ta~!) = a(bb~!)a~! = aea^! = aa^! = e 


so b~ ta~! is the inverse of ab. 
(d) Again by associativity we have 


a(a~'b) = (aa^!) — b and (ba™')a = b(a^!a) = b. 


The solutions are unique because of the cancellation laws. Oo 


pd 


Definition Powers of an element. If a e G we define a" for any integer n by the 
following relations: 


a9 


=e a -gqqa" !,  a"-(a'!y forn>0. 
The following laws of exponents can be proved by induction. We omit 
the proofs. 


Theorem 6.3 If a € G, any two powers of a commute, and. for all integers m and n 
we have 


a™a" = q"*^ = a^a" and (ay = qm = (a")". 
Moreover, if a and b commute we have 
a"b" = (aby". 


Theorem 6.4 Subgroup criterion. If G' is a nonempty subset of a group G, then 
G' is a subgroup if, and only if, G' satisfies group postulates (a) and (d): 
(a) Closure: If a, b e G', then ab e G'. 
(d) Existence of inverse. If a € G', then a^! € G'. 


Proor. Every subgroup G' certainly has these properties. Conversely, if G' 
satisfies (a) and (d) it is easy to show that G' also satisfies postulates (b) and (c). 
Postulate (b), associativity, holds in G' because it holds for all elements in G. 
To prove that (c) holds in G' we note that there is an element a in G' (since G' is 
nonempty) whose inverse a^! € G' (by (d)) hence aa^! € G' by (a). But aa^! 
=esoeeG. imi 


6.4 Construction of subgroups 


A subgroup of a given group G can always be constructed by choosing any 
element a in G and forming the set of all its powers a", n = 0, +1, +2... 
This set clearly satisfies postulates (a) and (d) so is a subgroup of G. It is called 
the cyclic subgroup generated by a and is denoted by <a). 
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i i if Gi . If a" = e for some positive 
that (a) is abelian, even if G is not. 
one ae will be a smallest n > 0 with this property and the subgroup 
a) will be a finite group of order n, 


<a) = (a;a^,...,a* a" =e}. 


The integer n is also called the order of the element a. An example of a cyclic 
subgroup of order n is the group of nth roots of unity mentioned in Section 
6.2. 


The next theorem shows that every element of a finite group has finite 
order. 


Theorem 6.5 If G is finite and a € G, then there is a positive integer n < |G| 
such that a" — e. 


Pnoor. Let g — |G|. Then at least two of the following g + 1 elements of G 
must be equal: 


e, a, a?, ...,a*. 


Suppose that a’ = a', where 0 < s < r < g. Then we have 


e-a(a)!-aq-. 


This proves the theorem with n = r — s. 


oO 


As noted in Section 6.2, every group G has two trivial subgroups, {e} and 


G itself. When G is a finite abelian group there is a simple process for con- 
structing an increasing collection of sub 


groups intermediate to {e} and G. 
The process, which will be described in Theorem 6.8, is based on the following 
observation. 


If G' isa subgroup ofa finite group G, then for any element a in G there is an 
integer n such that a" e G'. If a is already in G' we simply take n — 1. If 
a € G' we can taken to be the order of a, since a^ = e € G'. However, there may 
bea smaller positive power ofa which lies in G'. Bythe well-ordering principle 
there is a smallest positive integer n such that a" € G'. We call this integer the 
indicator of a in G'. 


Theorem 6.6 Let G' be a subgroup of a finite abelian group G, where G' # G. 
Choose an element a in G, a ¢ G', and let h be the indicator of a in G'. Then 
the set of products 


G" = (xa*:xx eG'andk = 0,1,2,,..,h — 1} 


is a subgroup of G which contains G'. 


Moreover, the order of G" 
that of G', 


is h times 
IG"| = h|G'|. 
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Proor. To show G” is a subgroup we use the subgroup criterion. First we 
test closure. Choose two elements in G", say xa* and yaj, where x, y e G' and 
0<k<h,0 < j< h. Since Gis abelian the product of the elements is 


(1) (xy)a*7. 
Now k + j = qh + r where 0 <r < h. Hence 
atti = a+" = ata" = za’, 

where z = a™ = (a")?eG’ since a* € G'. Therefore the element in (1) is 
(xyz)a" = wa’, where w € G' and 0 <r « h. This proves that G" satisfies the 
closure postulate. 

Next we show that the inverse of each element in G" is also in G". Choose 
an arbitrary element in G”, say xa*. If k = O then the inverse is x^! which is in 
G". If 0 < k < h the inverse is the element 


ya'-*, where y = x^ (a^ !, 


which again is in G". This shows that G" is indeed a subgroup of G. Clearly 
G" contains G'. 

Next we determine the order of G". Let m — |G'|. As x runs through the 
m elements of G' and k runs through the h integers 0, 1, 2,..., h — 1 weobtain 
mh products xa*. If we show that all these are distinct, then G" has order mh. 
Consider two of these products, say xa* and ya and assume that 


xa = ya! withO<j<k<h. 
Then a~} = x^!yand0 < k — j < h. Since x~ ty e G' we must have a‘~/ in 


G' so k = j and hence x = y. This completes the proof. 


6.5 Characters of finite abelian groups 


Definition Let G be an arbitrary group. A complex-valued function f defined 
on G is called a character of G if f has the multiplicative property 


f(ab) = f(a) f(b) 
for all a, b in G, and if f(c) # 0 for some c in G. 
Theorem 6.7 If f is a character of a finite group G with identity element e, then 


f(e) = 1 and each function value f (a) is a root of unity. In fact, if a" = e then 


f(ay = 1. 
Pnoor. Choose c in G such that f(c) # 0. Since ce = c we have 
SOF) = flc) 
so f(e) = 1. Ifa" = e then f(a} = f(a") = fe) = 1. o 
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ly the function which 
XAMPLE Every group G hasat least one character, name ly 

E identically 1 on G. This is called the principal character. The next theorem 
tells us that there are further characters if G is abelian and has finite order > 1. 


Theorem 6.8 A finite abelian group G of order n has exactly n distinct characters. 


Proor. In Theorem 6.6 we learned how to construct, from a given subgroup 
G' + G, a new subgroup G” containing G' and at least one more element a 


not in G'. We use the symbol (G'; a) to denote the subgroup G" constructed 
in Theorem 6.6. Thus 


(G5) = {xa*:x e G' and0 < k < h} 


where h is the indicator of a in G'. 

Now we apply this construction Tepeatedly, 
{e} which we denote by G,. IfG, # Gweleta, 
e and define G, = ¢G,; a,>.IfG, # G let d; 
in G, and define G3 = (G5; a5». Continue 
set of elements a,, a, +++,@, and a corres 
G2,..., G41 such that 


Starting with the subgroup 
be an element of G other than 
be an element of G which is not 
the process to obtain a finite 
ponding set of subgroups G,, 


G41 = (Gr; a) 


ps since the given group 
than its predecessor G,. 
ain of subgroups and Prove the theorem by induction, 
ue for G, it must also be true for G,, ,. 


. Itise namely the function which 
is identically 1. Assume, therefore, that G, has order m and that there are 


» that is, the smallest positi 


here are exactly h different Ways to extend each 
character of G 


c ) r+1 and that each character of 
G,.., is the extension of some character of r- This will prove that G,, , has 
exactly mh characters, and since mhis also the order of G,, , this will prove the 
theorem by induction on r. : 


A typical element in G,+1 has the form 
xa^ where xe G,and0<k < h. 


Suppose for the moment that it i 
G, +1. Call this extension fand let 
multiplicative property requires 


F(a) = f()(ay. 


S possible to extend a ch; 


aracter f of G, to 
us see what can be said 


about f(xa,"), The 
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But x € G, sof (x) = f(x) and the foregoing equation implies 


fea) = fo f(a. 


This tells us that f (xa,*) is determined as soon as J (a,) is known. 
What are the possible values for f (a)? Let c = a,". Since c e G, we have 
f(c) = f(c), and since f is multiplicative we also have f(c) = f (a). Hence 


Ja} = f(o) 
so f (a,) is one of the hth roots of f (c). Therefore there are at most h choices 
for f (a,). 
These observations tell us how to define f. If f is a given character of G,, 


we choose one of the hth roots of f (c), where c = a,", and define f (a,) to be 
this root. Then we define fon the rest of G,,, by the equation 


Q) fa) = fe9 (a). 


The h choices for f(a,) are all different so this gives us h different ways to 
define f (xa,"). Now we verify that the function f'so defined has the required 
multiplicative property. From (2) we find 


JG + ya) = f(xy a) = fox) a)? 

= SHAS VTT (a) 

= faf (a). 
so Jis a character of G,,. No two of the extensions fand g can be identical 
on G,,, because the functions f and g which they extend would then be 
identical on G,. Therefore each of the m characters of G, can be extended in h 
different ways to produce a character of G, + ,. Moreover, if ọ is any character 
of G,+ı then its restriction to G, is also a character of G,, so the extension 
process produces all the characters of G,,,. This completes the proof. O 


6.6 The character group 


In this section G is a finite abelian group of order n. The principal character 
of G is denoted by fı. The others, denoted by f2, f5, ..., fn, are called non- 
principal characters. They have the property that f(a) # 1 for some a in G. 


Theorem 6.9 If multiplication of characters is defined by the relation 
(fifa) = fia) fa) 


for each a in G, then the set of characters of G forms an abelian group of 
order n. We denote this group by G. The identity element of G is the principal 
character f. The inverse of f; is the reciprocal 1/f,. 


Pnoor. Verification of the group postulates is a straightforward exercise and 
we omit the details. 
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Note. For each character f we have | f(@)| = 1. Hence the reciprocal 
1/f (a) is equal to the complex conjugate f (a). Thus, the function J defined by 
f(a) = f(a) is also a character of G. Moreover, we have 

z 1 " 
a) = —— = 
Sla) Ta f(a^?) 


for every a in G. 


6.7 The orthogonality relations for characters 


Let G be a finite abelian group of order n with elements a,, a;, ... > ân, and 
let fisfo,-.., f, be the characters of G, with f, the principal character. 


Notation We denote by A= A(G) then x n matrix [a;;] whose element dij in 
the ith row and jth column is 


dij — Fifa). 
We will prove that the matrix A 


to deduce the so-called orthogonali 
determine the sum of the entries in ea 


has an inverse and then use this fact 


ty relations for Characters. First we 
ch row of A. 


Theorem 6.10 The sum of the entries in the ith TOW of A is given by 


a _ J^ if f; is the principal character (i = 1), 
2 fla) = lo otherwise, 


PROOF. Let S denote the S 


and $ — n. If f, f, 
runs through the elem 


S= X fba) = AÈ fla) = fios. 


Therefore S(1 — Sib)) = 0. Since Sb) # Lit follows that § = 0. 


Oo 
Now we use this theorem to show that A has an inverse, 
Theorem 6.11 Let 4* denote the Conjugate transpose of the matrix A. Then we 
have 


AA* =n], 
where I is then x n identity matrix. Hence n- 


'A* is the inverse of A. 
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Proor. Let B = AA*. The entry b; in the ith row and jth column of B is 
given by 


by = Y. Had Fa) = EGI) = Y fla 
where f, = n = f/f; Now f/f; = fi if, and only if, i =j. Hence by 


Theorem 6.10 we have 
n ifi-j, 
b= 
x lb if i # j. 


In other words, B = nl. o 


Next we use the fact that a matrix commutes with its inverse to deduce the 
orthogonality relations for characters. 


Theorem 6.12 Orthogonality relations for characters. We have 
n n if a; =a; 
3 ) f(a) = pi 
(3) 2 Fai) fAa;) lo il aita: 
Pnoor. The relation AA* = nI implies A*A = nI. But the element in the ith 
row and jth column of A*A is the sum on the left of (3). This completes the 
proof. oO 


Note. Since f(a;)) = fí(a) ' = J,(a;~'), the general term of the sum in (3) 
is equal to f,(a;—*) f,(aj) = f(a; ‘a;). Therefore the orthogonality relations 


can also be expressed as follows: 
LI ips pd if a; = aj, 
2, Sara) = le if a; # aj. 


r=1 


When a; is the identity element e we obtain: 


Theorem 6.13 The sum of the entries in the jth column of A is given by 


n ifaj—e, 


(4) 2. fa) n » otherwise. 


6.8 Dirichlet characters 


The foregoing discussion dealt with characters of an arbitrary finite abelian 
group G. Now we specialize G to be the group of reduced residue classes 
modulo a fixed positive integer k. First we prove that these residue classes do, 
indeed, form a group if multiplication is suitably defined. 

We recall that a reduced residue system modulo k is a set of q(k) integers 
(a1, 2; - - -» dou) incongruent modulo k, each of which is relatively prime to 
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k. For each integer a the corresponding residue class â is the set of all integers 
congruent to a modulo A: 
à = (x: x =a (mod K)). 
Multiplication of residue classes is defined by the relation 
(5) â- b = ab. 


That is, the product of two residue classes â and b is the residue class of the 
product ab. 


Theorem 6.14 With multiplication defined by (5), the set of reduced residue 
classes modulo k is a finite abelian group of order ok). The identity is the 
residue class 1. The inverse of á is the residue class b where ab = 1 (mod k) 


Proor. The closure property is automatically Satisfied because of the way 
multiplication of residue classes was defined. The class 1 is clearly the identity 
element. If (a, k) = 1 there is a unique b such that ab = 1 (mod k). Hence 


the inverse of à is b. Finally, it is clear that the group is abelian and that its 
order is q(k). D 


Definition Dirichlet characters, Let G be the group of reduced residue 


classes modulo k. Corresponding to each character f of G we define an 
arithmetical function y = Xy as follows: 


xn) = f(A) if(n k) = 1, 
x(n) =0 if (n, k) > 1. 


The function y is called a Dirichlet character modulo k. The principal 
character y, is that which has the properties 


_ fl if, =1, 
nto) =f) if(n, k) > 1, 


Theorem 6.15 There are ok) 


which is completely multip| 
have 


(6) 
and 


distinct Dirichle| 


d t characters modulo k, each of 
licative and peri 


odic with period k. That is, we 


x(mn) = x(m)x(n) for all m,n 


x(n + k) = x(n) for all n. 
Conversely, if y is completely 
and if (n) = 0 if (n, k) 1, th 
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members of (6) are zero. The periodicity property follows from the fact that 
x(n) = f(ñ) and that a = b (mod k) implies (a, k) = (b, k). 
To prove the converse we note that the function f defined on the group 
G by the equation 
fâ) = x(n) if(nk)=1 


is a character of G, so x is a Dirichlet character mod k. o 


ExAMPLE When k= 1 or k =2 then g(k) =1 and the only Dirichlet 
character is the principal character y,. For k > 3, there are at least two 
Dirichlet characters since q(k) > 2. The following tables display all the 
Dirichlet characters for k — 3, 4 and 5. 


n l 2 4 ws 

xin 1 1 1 10 

m ud 2,3 ud 2 34 y(n) 1 -1 -1 10 

xim 1 1 0 xn) 1 0 1.0 xam 1 i -i -1 0 

y(n) 1 -1 0 xYn 1 0 —1 0 Xn 1 -—i i -1 0 
k = 3, p(k) =2 k = 4, g(k) =2 k= 5, o(k) =4 


To fill these tables we use the fact that y(n)” = 1 whenever (n, k) = 1, 
so x(n) is a p(k)th root of unity. We also note that if y is a character mod k so is 
the complex conjugate 7. This information suffices to complete the tables for 
k =3andk = 4. 

When k = 5 we have (5) = 4 so the possible values of x(n) are + 1 and +i 
when (n, 5) = 1. Also, x(2)x(3) = X(6) = x(1) = 1 so x(2) and (3) are re- 
ciprocals. Since x(4) = x(2)? this information suffices to fill the table for 
k = 5. Asa check we can use Theorems 6.10 and 6.13 which tell us that the 
sum of the entries is 0 in each row and column except for the first. The follow- 
ing tables display all the Dirichlet characters mod 6 and 7. 


n 1234 5 6 n 3 2 3 4 5 6 7 

OPIDO OUR ERR NN MIT. 7. i 

xin 1000 -I 0 xxn) 1 1 -1 1'-—p o «1340 
k = 6, p(k) = 2 wa) 1 o o -o -o -10 og 

X 1. o? —o —-o o 10 

x(n) 1 —o o o —w 10 

x(m 1 —o —o? c? ow -10 


k-7,9() =6 
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In our discussion of Dirichlet’s theorem on primes in an arithmetic 


progression we shall make use of the following orthogonality relation 
for characters modulo k. 


Theorem 6.16 Let 7;,..., Xo denote the olk) Dirichlet characters modulo k. 
Let m and n be two integers, with (n, k) = 1. Then we have 


S zü)- 909 if mzn (mod &), 
Xem ie i, if m n (mod k). 


Proor. If (m, k) = 1 take 4; = fi and a; = ri in the orthogonality relations 
of Theorem 6.12 and note that =ñ if, 


> and only if, m =n (mod k). If 
(m, k) > 1 each term in the sum vanishes and m € n (mod k). Li 


6.9 Sums involving Dirichlet characters 


This section discusses certain su 
theorem on primes in arithmetic 

The first theorem refers to 
proof is also valid if X is any 
period k and has bounded parti. 


ms which occur in the proof of Dirichlet's 
al progressions. 


a nonprincipal character x mod k, but the 

arithmetical function that is periodic with 

al sums. 

Theorem 6.17 Let X be any non 
nonnegative function which 
all x > xo. Then if y > x > 


(7). 


principal character modulo k, and let f be a 
has a continuous negative derivative f'(x) for 
Xo we have 


Y, xn) f(r) = O(F (x). 


If, in addition, f (x) + 0 as x > 96, then the infinite series 


D xo)f (n) 


converges and we have, for x > Xo, 


(8) LHF) = Y infin) oti 


_PRoor. Let A(x) = De x(n). Since y is nonprincipal we have 


k 
A(k) = Y y(n) = 0. 
n=1 


By periodicity it follows that A(nk) = 
for all x. In other words, A(x) = O(1). 
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Oforn = 2,3, hence | A(x)| < p(k) 


6.10: The nonvanishing of L(1, y) for real nonprincipal y 
Now we use Abel’s identity (Theorem 4.2) to express the sum in (7) as an 
integral. This gives us 


Y, x00f() = f0)40) — SAS) — [aor ‘dt 


x<nsy 
= OO) + O(f G9) + of [ero a) = o(s). 
This proves (7). If f(x) > 0 as x + oo then (7) shows that the series 
OO) 


converges because of the Cauchy convergence criterion. To prove (8) we 
simply note that 


Y aod soy = Efo) lim Son 


y-o x<n 


Because of (7) the limit on the right is O(f(x)). This completes the proof. O 


Now weapply Theorem 6.17 successively with f(x) = 1/x, f(x) = (log x)/x, 
and f(x) = 1/,/x for x > 1 to obtain: 


Theorem 6.18 If y is any nonprincipal character mod k and if x > 1 we have 


x) _ S xm  o(1 
0 iei. 
l 2 y(n) l 
(10) z = ee X = um of = 3! 


xm) _ & xn 1 
(11) aman ue «e ) 
mom du Vx 
6.10 The nonvanishing of L(1, y) for real 
nonprincipal y 
We denote by L(1, y) the sum of the series in (9). Thus, 
e n 
10,3 - y, 9. * 
n=1 Nt 
In the proof of Dirichlet's theorem we need to know that L(1, y) # 0 when x 
is a nonprincipal character. We prove this here for real nonprincipal char- 


acters. First we consider the divisor sum of x(n). 
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Theorem 6.19 Let y be any real-valued character mod k and let 
Aln) = ¥ x(d). 
djn 


Then A(n) > 0 for all n, and A(n) > 1 if n is a square. 


Proor. For prime powers we have 
a a 
Ap) = } xp) = 1+ È x7. 
t=0 t= 


Since y is real-valued the only possible values for X(p) are 0, 1 and —1. If 
X(p) = 0 then A(p°) = 1; if x(p) = 1 then A(p’) = a + 1; and if xp) = -1 
then 


r 0 ifais odd, 
gC) = f if a is even. 
In any case, A(p°) > 1 if a is even. 

Now if n= p,“ -+ p,” then A(n) = A(p,") --- A(p,*) since A is multi- 
plicative. Each factor A(p;") > 0 hence A(n) > 0. Also, if n is a square then 
each exponent a; is even, so each factor A(p;") > 1 hence A(n) 2 1. This 
proves the theorem. 


a 


Theorem 6.20 For any real-valued nonprincipal character y mod k, let 


An) =F xd) and Bo) = y A). 
din nsx n 
Then we have: 


(a) B(x) > oo as x > oo. 
(b) Bix) = 2/xL(1, y) + o(1) for all x > 1, 
Therefore L(1, y) + 0. 


Pnoor. To prove part (a) we use Theorem 6.19 to write 


Bee a yl 
nsx n msyx m 
n=m2 


The last sum tends to co as x > © since the harmonic Series Y^ 1/m diverges. 
To prove part (b) we write 


1 
BR) = y y yes x(a) 
Ea Um 
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Now we invoke Theorem 3.17 which states that 


Y fasad = Xf woz) + row) — F(a)G(b) 
ad nsa nsb n 

qdsx 
where ab = x, F(x) = Jn <x f(n), and G(x) = Y,<,g(n). We take a = b = 
Jx and let f(n) = xin, g(n) — 1//n to obtain 

-yL y n o() cessed Le «E — FLAG 
(12 B(x) = r Tad wes Ja > PE. x VIG). 

qdsx 
By Theorem 3.2 we have 
1 1 
6jo= 3 =z out 
(= Dadar at (2) 


where A is a constant, and by Theorem 6.18, Equation (11), we have 


= ph (2$ 
F(x) L5 B+ m. 


where B = Y*., x(n)/,/n. Since F(/x)G(/x) = 2Bx!^ + O(1), Equation 
(12) gives us 


ei AEn iB 
E o t E o( P) — 2Bx!* + Q(1) 
nS A/ n Pe 
x 


Jn 
x(n) x(n) 1 ) 
tay Wy (7. X, wt 


(n 
n 
z 
-2/x Y 

nsy/x n nsJx/n 


1 1 
BY —+0(— Y 1|-28x'^ +00 
T is (EU) d is 
= 2 /x L(l, x) + O(1). 


This proves part (b). Now it is clear that parts (a) and (b) together imply that 
L(, x) # 0. 


Exercises for Chapter 6 


1, Let G be a set of nth roots of a nonzero complex number. If G is a group under 
multiplication, prove that G is the group of nth roots of unity. 


2. Let G be a finite group of order n with identity element e. If Q,,...,a, are n 
elements of G, not necessarily distinct, prove that there are integers p and q with 
1 <p < q < n such that a,a,,,---a,=e. 
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b : ] 
3. Let G be the set of all 2 x 2 matrices k A where a, b, c, d are integers with 


ad — bc = 1. Prove that G is a group under matrix multiplication. This group is 
sometimes called the modular group. 


4. Let G = (a) be a cyclic group generated by a. Prove that every subgroup of G is 
cyclic. (It is not assumed that G is finite.) 


5. Let G be a finite group of order n and let G' be a subgroup of order m. Prove that 
m|n (Lagrange's theorem). Deduce that the order of every element of G divides n. 


6. Let G bea group of order 6 with identity element e. Prove that either G is cyclic, or 
else there are two elements a and b in G such that 


G = (a, a*, a?, b, ab, a?b), 
with a? = b? = e. Which of these elements is ba? 


7. A group table for a finite group G = (a,,...,a,) of order n is an n x n matrix 
whose ij-entry is a,a;. If a;a; = e prove that aja; = e. In other words, the identity 
element is symmetrically located in the group table. Deduce that if n is even the 
equation x? = e has an even number of solutions. 


8. Generalizing Exercise 7, let f(p) denote the number of solutions of the equation 
x? = e, where p is a prime divisor of n, the order of G. Prove that P| f (p) (Cauchy's 
theorem). [Hint: Consider the set S of ordered p-tuples (a,,..., ap) such that a; € G 
and a, ---a, = e. There are n^7! p-tuples in S. Call two such p-tuples equivalent 
if one is a cyclic permutation of the other. Show that f(p) equivalence classes contain 
exactly one member and that each of the others contains exactly p members. Count 
the number of members of S in two ways and deduce that pl f(p)- 

9. Let G be a finite group 


of order n. Prove that n is odd if, and only if, each element of. 
G is a square. That is, 


for each a in G there is an element b in G such that a = b?, 


10. State and prove a generalization of Exercise 9 in which the condition “n is odd" is 
replaced by “n is relatively prime to k” for some k > 2. 


11. Let G be a finite group of order n, and 
elements of G. Prove that for each gin 
ab — g. 


let S be a subset containing more than n/2 
G there exist elements a and b in $ such that 


12. Let G bea group and let $ bea Subset of n distinct elemen 


13. Let fi, ..., fa be the characters ofa finite group G of order m, and 
of G of order n. Theorem 6.7 s us da Pint 


hows that each number f(a) is an nth Toot of unity. 
Prove that every nth root of unity occurs e 


qually often amo; th 
fala), ..., fala). [Hint: Evaluate the sum "semi fila, 


X È Slate 2r 


r=Lk=1 
in two ways to determine the number of times exi» 


occurs, ] 
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14. Construct tables showing the values of all the Dirichlet characters mod k for 
k = 8,9, and 10. 


15. Let x be any nonprincipal character mod k. Prove that for all integers a < b we have 


b 
LY xn) 


1 
< 3 ek). 
16. If y is a real-valued character mod k then x(n) = +1 or 0 for each n, so the sum 
k 
S= Y nxin) 
n=1 


is an integer. This exercise shows that 128 = 0 (mod X). 


(a) If (a, k) = 1 prove that ax(a)$ = S (mod k). 

(b) Write k = 2*g where q is odd. Show that there is an integer a with (a, k) = 1 
such that a = 3 (mod 2?) and a = 2 (mod q). Then use (a) to deduce that 
128 = 0 (mod k). 


17. An arithmetical function f is called periodic mod k if k » 0 and f(m) = f(n) 
whenever m = n (mod k). The integer k is called a period of f. 


(a) If f is periodic mod k, prove that f has a smallest positive period kọ and that 
kolk. 

(b) Let f be periodic and completely multiplicative, and let k be the smallest 
positive period of f. Prove that f(n) = 0 if (n, k) > 1. This shows that f is a 
Dirichlet character mod k. 

18. (a) Let f bea Dirichlet character mod k. If k is squarefree, prove that k is the smallest 


positive period of f. 
(b) Give an example of a Dirichlet character mod k for which k is not the smallest 


positive period of f. 
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Dirichiet’s Theorem on 
Primes in Arithmetical Progressions 


7.1 Introduction 


(1) kn +h,n=0,1,2,... 


If h and k have a common factor d, each term of the progression is divisible 
by d and there can be no more than one prime in the Progression if d > 1, 
In other words, a necessary condition for the existen 


arithmetic progression (1) contains infinitely ma: 


on a proof published in 1950 by Harold N. Sha iro [65 i 
series )’ p^! log p rather than pE EPERE] soos with he 


First we show that for Certain special 


7.2: Dirichlet's theorem for primes of the form 4n — 1 and 4n + 1 


7.2 Dirichlet's theorem for primes of 
the form 4n — 1 and 4n + 1 


Theorem 7.1 There are infinitely many primes of the form 4n — 1. 


Pnoor. We argue by contradiction. Assume there are only a finite number of 
such primes, let p be the largest, and consider the integer 


N = 2?.3-5---p—1, 
The product 3 - 5--- p contains all the odd primes <p as factors. Since N 
is of the form 4n — 1 it cannot be prime because N > p. No prime «p 
divides N, so all the prime factors of N must exceed p. But all of the prime 
factors of N cannot be of the form 4n + 1 because the product of two such 


numbers is again of the same form. Hence some prime factor of N must be 
of the form 4n — 1. This is a contradiction. oO 


A different type of argument can be used for primes of the form 4n + 1. 


Theorem 7.2 There are infinitely many primes of the form 4n + 1. 


Proor. Let N be any integer > 1. We will sh 


ow that there is a prime p > N 
such that p = 1 (mod 4). Let 


m — (NY? +1. 


Note that m is odd, m > 1. Let p be the smallest prime factor of m. None of 
the numbers 2, 3,..., N divides m, so p > N. Also, we have 


(N!? = —1 (mod p). 
Raising both members to the (p — 1)/2 power we find 
(NDP! = (—1)9- 9? (mod p). 
But (N!)"^! = 1 (mod p) by the Euler-Fermat theorem, so 
(~~? = 1 (mod p), 
Now the difference (— 1y»- 1/2 _ | is either 0 or —2, and 
because it is divisible by p, so it must be 0. That is, 


(-1)?- 92 51 


it cannot be —2, 


But this means that (p — 1)/2 is even, so p=i 
we have shown that for each integer N > 1 there i 


p = 1 (mod 4). Therefore there are infinitely 
4n 4 1. 


(mod 4). In other words, 
S a prime p > N such that 
many primes of the form 


a 


primes of the form 4n — 1 
cial arithmetic Progressions, 


Simple arguments like those just given for 
and 4n + 1 can also be adapted to treat other spe 
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= ierpinski [67]), but no one 
5n — 1, 8n — 1, 8n — 3 and 8n + 3 (see Sierpins| : 
Horae found such a simple argument that works for the general progression 
kn + h. 


7.3 The plan of the proof of Dirichlet’s 
theorem 


In Theorem 4.10 we derived the asymptotic formula 


Q) y zu. log x 4- O(1), 


psx 


where the sum is extended over all primes p < x. We shall prove Dirichlet’s 
theorem as a consequence of the following related asymptotic formula. 


Theorem 7.3 If k > 0 and (h, k) = 1 we have, for all x > 1, 


log p 1 
(3) —— =—, log x + O(1), 
yx po wks 
pzh (modk) 
where the sum is extended over those primes p € x which are congruent to 
h mod k. 


Since log x > oo as x — oo this relation im; 
many primes p =h ( 
nk + hn =0,1,2,... 


Note that the principal term on the right of (3) is independent of h. There- 
fore (3) not only implies Dirichlet’s theorem but it also shows that the primes 
in each of the g(k) reduced residue classes mod k make the same contribution 
to the principal term in (2). 


plies that there are infinitely 
mod k), hence infinitely many in the progression 


The proof of Theorem 7.3 will be presented through a sequence of lemmas 
which we have collected together in this section to reveal the plan of the 
proof. Throughout the chapter we adopt the following notation, 

The positive integer k represents a fixed modulus, and h is a fixed integer 
relatively prime to k. The ø(k) Dirichlet characters mod k are denoted by 


Xi» Xii Xow 


with y, denotin 


: g the principal character. For y # X1 We write L(1, y) and 
L'(L, x) fo 


T the sums of the following series: 
Uy = y X9 
n=1 Nn 


Li, = - y Meer 
n=1 
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7.3: The plan of the proof of Dirichlet’s theorem 


The convergence of each of these series was shown in Theorem 6.18. More- 
over, in Theorem 6.20 we proved that L(1, y) ¥ 0 if y is real-valued. The 
symbol p denotes a prime, and } „<x denotes a sum extended over all primes 
psx. 


Lemma 7.4 For x > 1 we have 


logp 1 1 ee x(p)log p 
— =— logx + — Yzx(hy, = * + 011). 
ms p om” olk) pu p 0 
p=h (modk) 


It is clear that Lemma 7.4 will imply Theorem 7.3 if we show that 


(4) y meer = 0(1) 


psx 


for each y # xı. The next lemma expresses this sum in a form which is not 
extended over primes. 


Lemma 7.5 For x > 1 and x # y, we have 


BP pa, yy HO). ow, 
ee p nsx n 


Therefore Lemma 7.5 will imply (4) if we show that 


(5) X Men = O(1). 


nsx 


This, in turn, will be deduced from the following lemma. 


Lemma 7.6 For x > 1 and x # x, we have 


(6) 1, p y, AHO - ow, 


nsx 


If L(1, x) z 0 we can cancel L(1, y) in (6) to obtain (5). Therefore, the 
proof of Dirichlet’s theorem depends ultimately on the nonvanishing of 
L(A, x) for all x # x1. As already remarked, this was proved for real y # y, in 
Theorem 6.20, so it remains to prove that L(1, y) # 0 for all x # x, which 
take complex as well as real values. 

For this purpose we let N(k) denote the number of nonprincipal characters 
y mod k such that L(l, x) = 0. If L(1, x) = 0 then L(1, 7) = 0 and x x 
since x is not real. Therefore the characters y for which L(1, y) = 0 occur in 
conjugate pairs, so N(k) is even. Our goal is to prove that N(k) — 0, and this 
will be deduced from the following asymptotic formula. 
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Lemma 7.7 For x > 1 we have 


logp _ 1— N(k) 
LE EEEEPT US 


P=l1 (modk) 


log x + O(1). 


If N(k) # 0 then N(k) > 2 since N(k) is even, hence the coefficient of 
log x in (7) is negative and the right member > —© as x c. This is a 
contradiction since all the terms on the left are positive. Therefore Lemma 7.7 
implies that N(k) — 0. The proof of Lemma 7.7, in turn, will be based on the 
following asymptotic formula. 


Lemma 7.8 If y # y, and L(1, y) = 0 we have 


L'(L x) pane) = logx + O(1). 


nsx 


7.4 Proof of Lemma 7.4 


To prove Lemma 7.4 we be 


gin with the asymptotic formula mentioned 
earlier, 


lo; 1 
(2) à» SEE. log x + O(1) 
psx P 
and extract those terms in the sum arising from primes p = h (mod k). The 


extraction is done with the aid of the orthogonality relation for Dirichlet 
characters, as expressed in Theorem 6.16: 


ope MeO (mod k), 
uet) P t ifm #n (mod k). 


This is valid for (n; k) = 1. We take m = pandn = h, where (h, k) = 1, then 
multiply both members by p^! log p and sum over all p < x to obtain 


olk) M lo I 
8 YY with 2? = oy y er 
psxr=1 p psx 
P=h (mod k) 
In the sum on the left we isolate those terms involving only the Principal 
character y, and rewrite (8) in the form 


lo » lo Bn p) 

O e D -amy BOE? | Sr y. Mplogp: 

Sx p psx p r=2 psx D 
p=h (mod) 


Now ,(h) = 1 and Xi(p) = 0 unless (p, k) = 1, in whi 


ch case y,(p) = 1. 
Hence the first term on the right of (9) is given by 


logp log p log lo 
(1) — y Sy pre m SEP tty 
pex P psx P psx P psx P 
(P.K=1 pik 
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7.5: Proof of Lemma 7.5 


since there are only a finite number of primes which divide k. Combining 
(10) with (9) we obtain 


lo lo «o „(p)lo 
ok X TOR PIC » BEP. Ya xplore | Oy) 
P psx P r=2 psx P 


psx 
pzh (modk) 


Using (2) and dividing by o(k) we obtain Lemma 7.4. [s] 


7.5 Proof of Lemma 7.5 

We begin with the sum 
Y AD 
nsx 


where A(n) is Mangoldt’s function, and express this sum in two ways. First 
we note that the definition of A(n) gives us 


Y XA) _ 2» x(p^)log p 
n ce 


nsx psxa=1 P 
psx 
We separate the terms with a = 1 and write 
n)A(n) lo, = lo 
an y A9 zy d EP. y y osr 
nsx n psx p psx a=2 p 
psx 


The second sum on the right is majorized by 


el. logp = logn M 
Zoer) a= iD < Danie > 


n-2 


so (11) gives us 


(12) y Wer _ X An)A(n) o). 
psx p nsx in 


Now we recall that A(n) = Fun u(d)log(n/d), hence 
Al 
pO y Oy doe. 


nsx nsx M q 
In the last sum we write n = cd and use the multiplicative property of X 
to obtain 


(13) 


Xn)AQ) p(d)x(d) x(c)log c 
2 n 2 d du c : 


Since x/d > 1, in the sum over c we may use formula (10) of Theorem 6.18 to 
obtain 


x/d 


esx/d 


nee "AP Lp + o( 8 4. 
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Equation (13) now becomes 
x(n)A(Q) > u(d)x(d) 1 log x/d\ 
Cages a 0 ua 
The sum in the O-term is 
1 
1 ¥ (log x — log d) - — (bares x — J log a) = O(1) 
X dex x dsx 
since 
È log d = log[x]! = x log x + O(x). 
dsx 
Therefore (14) becomes 


xe)AG) _ 
y ene) 


nsx 


-Lq, 3 y, AHO , oq 
dsx 


which, with (12), proves Lemma 7.5. 


7.6 Proof of Lemma 7.6 


We use the generalized Móbius inversion formula proved in Theorem 2.23 
which states that if œ is completely multiplicative we have 


(15) G(x) = Y a(n)FI () if, and only if, F(x) = y: Hoaono() 


nsx 


We take o(n) = y(n) and F(x) = x to obtain 


(16) pes une) 


where 


GW) = Yxmi-xy X9) 
nsx n 
By Equation (9) of Theorem 6.18 we can write G(x) = xL(1, y) + O(1). 
Using this in (16) we find 2 


nsx 


c= Y we L(t, 2) + ow} = xia, p y, X9 , oc 
nsx 


Now we divide by x to obtain Lemma 7.6. 
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7.8: Proof of Lemma 7.7 


7.7 Proof of Lemma 7.8 


We prove Lemma 7.8 and then use it to prove Lemma 7.7. Once again we 
make use of the generalized Möbius inversion formula (15). This time we 
take F(x) = x log x to obtain 


(17) xlgx-)Y wontne( ) 
where 
G(x) = Y xr) = log ~ = xlogxY x(n) _ x», mem. 


Now we use formulas (9) and (10) of Theorem 6.18 to get 


G(x) = x log fra. y+ «(2 + «ro. T o( 28 3l 


= xL'(1, x) + O(log x) 


since we are assuming that L(1, y) = 0. Hence (17) gives us 
xlogx= Y pee L(t, ) + o( tos 3 


shay eres Henn + «(x ¥ dog x — log 21 


nsx 
We have already noted that the O-term on the right is O(x) (see the proof of 
Lemma 7.5). Hence we have 


Mens) 


x log x = xL (1, x) , ——— + O(), 


nsx 


and when we divide by x we obtain Lemma 7.8. O 


7.8 Proof of Lemma 7.7 
We use Lemma 7.4 with h = 1 to get 


= logx + 
2 p oe OS e 


p=1 (modk) 


In the sum over p on the right we use Lemma 7.5 which states that 


x(p)log p : u(n)x,(n) 
» ee Lü.x)Y, : 


psx 


(18) 


I 1 1 99 l 
y log Pp _ Sy ure + OQ). 


+ O(1). 
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If L(1, y,) # 0, Lemma 7.6 shows that the right member of (18) is O(1). But 
if L(1, x,) = 0 then Lemma 7.8 implies 
uln)x{n) 1 
-L'i — = —log x + 0(1). 
LDE 7 gx + Of 
Therefore the sum on the right of (18) is 


1 
20 (—N(k)log x + O(1)), 


so (18) becomes 
logp 1-—N(k) 
= ——75— log x + O(1): 
px Pp eu) ^8 0 
p=1 (modk) 
This proves Lemma 7.7 and therefore also Theorem qua m| 


As remarked earlier, Theorem 7.3 implies Dirichlet’s theorem: 


Theorem 7.9 If k > 0 and (h, k) = 1 there are infinitely many primes in the 
arithmetic progression nk + h, n = 0,1,2..5 


7.9 Distribution of primes in arithmetic 
progressions 


If k > 0 and (a, k) = 1, let 
-()- Y 1. 


PSx 
P=a (modk) 
The function z,(x) counts the number of 
nk + a,n = 0, 1,2, ... Dirichlet's theorem s 
There is also a prime number theorem fo; 
states that 

(x) 1 x 
(19) Wy ad X 
d) m n e()logx >% 

if (a, k) = 1. A proof of (19) is outlined in [44]. 


The prime number theorem for Progressions is suggested by the formula 
of Theorem 7.3, 


primes «x in the progression 
hows that TAX) — oo as x > oo. 
r arithmetic progressions which 


log p il 
EM Og F OL. 
hop mp Be + O0) 
pzh (modk) 


Since the principal term is independent of h, the primes seem to be equally 
distributed among the q(k) reduced residue classes mod k, and (19) is a 
precise statement of this fact. 
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Exercises for Chapter 7 


We conclude this chapter by giving an alternate formulation of the prime 
number theorem for arithmetic progressions. 


Theorem 7.10 If the relation 


X, 
(20) nx) ~ E 
holds for every integer a relatively prime to k, then 
(21) T(x) ~ T(x) asx- oo 
whenever (a, k) = (b, k) = 1. Conversely, (21) implies (20). 


Pnoor. It is clear that (20) implies (21). To prove the converse we assume (21) 
and let A(k) denote the number of primes that divide k. If x > k we have 


z(x) - Y1-24()4 X1 


psx psx 
přk 
k k 
=Ak)+ X X 12407 Y n9. 
a=1 psx a=1 
(a,k)=1 pza (modk) (a,k)=1 


Therefore 
n(x) — Ak) — & mn) 


m(x) a=1 m(x) 
(a,k)=1 


By (21) each term in the sum tends to 1 as x — oo so the sum tends to ¢(k). 
Hence 

c AD oft) as x — oo. 

mx) m(x) 


But A(k)/7;(x) + 0 so n(x)/n,(x) > o(k), which proves (20). o 


Exercises for Chapter 7 

In Exercises 1 through 4, h and k are given positive integers, (h, k) = 1, 
and A(h, k) is the arithmetic progression A(h, k) = {h + kx:x = 0,1,2,...). 
Exercises 1 through 4 are to be solved without using Dirichlet’s theorem. 


1. Prove that, for every integer n > 1, A(h, k) contains infinitely many numbers relatively 
prime to n. 


2. Prove that A(h, k) contains an infinite subset (a;, a5, ...) such that (a;, a)=1if 
ij. 
3. Prove that A(h, k) contains an infinite subset which forms a geometric progression 


(a set of numbers of the form ar", n = 0, 1, 2, ...). This implies that A(h, k) contains 
infinitely many numbers having the same prime factors. 
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H 


4. Let S beany infinite subset of A(h, k). Prove that for every positive integer n there isa 


number in A(h, k) which can be expressed as a product of more than n different 
elements of S. 


5. Dirichlet's theorem implies the following statement: If h and k > 0 are any two 


integers with (h, k) = 1, then there exists at least one prime number of the form 
kn + h. Prove that this statement also implies Dirichlet's theorem. 


6. If (h, k) = 1, k > 0, prove that there is a constant A (depending on h and on k) such 
that, if x 2 2, 


1 1 1 
— =—~ log log x + A + Of —— 
Zp gio Ao. 
pzh (modk) 
7. Construct an infinite set S of primes with the followin, : 
g property: If p e S and 
then (Hp — 1,48 — 1) = (p,q — 1) = (p — 1,9) = 1. ais lene 


8. i DUREE E polynomial of degree n > 1 with the following property: 
or each prime p there exists a prime q and an intege x 
Prove that q — p, m — n and fe) Hit aes 


= x" for all x. [Hint: If q # p th *! divi 
Sip + tg"*") — f(p) for each t = 1,2,...] eS Sm 


Periodic Arithmetical Functions 
and Gauss Sums 


8.1 Functions periodic modulo k 


Let k be a positive integer. An arithmetical function f is said to be periodic 
with period k (or periodic modulo k) if 


f(n + k) = fin) 


for all integers n. If k is a period so is mk for any integer m > 0. The smallest 
positive period of fis called the fundamental period. 

Periodic functions have already been encountered in the earlier chapters. 
For example, the Dirichlet characters mod k are periodic mod k. A simpler 
example is the greatest common divisor (n, k) regarded as a function of n. 
Periodicity enters through the relation 


(n + k, k) = (n, k). 
Another example is the exponential function 
f(n) = e2nimnik 
where m and k are fixed integers. The number e?" is a kth root of unity 
and f (n) is its nth power. Any finite linear combination of such functions, say 
»» c(m)e?7imni 
m 


is also periodic mod k for every choice of coefficients c(m). Our first goal is to 
show that every arithmetical function which is periodic mod k can be 
expressed as a linear combination of this type. These sums are called finite 
Fourier series. We begin the discussion with a simple but important example 
known as the geometric sum. 
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8: Periodic arithmetical functions and Gauss sums 


Theorem 8.1 For fixed k > 1 let 
k-1 E 
gin) = 7 girimnlk. 
m-0 


Then 


0 ifkyn 
de) — 1, if kin. 


Pnoor. Since g(n) is the sum of terms in a geometric progression, 


k-i 
gin) = Y x", 
m=0 


where x = e?*"/k we have 
x -—1 
—— j 1 
gn-lx-i !**L 
k fx = 1; 
But x* = 1, and x = 1 if and only if k|n, so the theorem is proved. D 


8.2 Existence of finite Fourier series for 
periodic arithmetical functions 


We shall use Lagrange's polynomial interpolation formula to show that every 
periodic arithmetical function has a finite Fourier expansion. 


Theorem 8.2 Lagrange's interpolation theorem. Let zo, Zi, 
distinct complex numbers, and let wo, Wi, 
which need not be distinct. Then there 
<k — 1 such that 


«es Zk-4 be k 
+++) Wa be k complex numbers 
is a unique polynomial P(z) of degree 
P(z,) = Wa form = 0, PX oe a 


Proor. The required polynomial P(z), called the Lagrange interpolation 
polynomial, can be constructed explicitly as follows. Let 


Ale) = (2 ~ zo) — 24)---(@ — 2.) 
and let 
As) = A@ 


Then A,(z) is a polynomial of degree k — 1 with the following properties: 
Anl@m) #0, Anz) =O ifjzm 
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8.2: Existence of finite Fourier series for periodic arithmetical functions 


Hence A,,(z)/A,,(Zm) is a polynomial of degree k — 1 which vanishes at each 
z; for j # m, and has the value 1 at z,,. Therefore the linear combination 


ee) 
e a a 


is a polynomial of degree <k — 1 with P(z,) = w; for each j. If there were 
another such polynomial, say Q(z), the difference P(z) — Q(z) would vanish at 
k distinct points, hence P(z) = Q(z) since both polynomials have degree 
<k-1. (m; 


Now we choose the numbers Zo, Z4, ..., Z- ; to be the kth roots of unity 
and we obtain: 


Theorem 8.3 Given k complex numbers wo, w,, .. . , w,..,, there exist k uniquely 
determined complex numbers ao, a, ... , ay.., such that 
k-1 , 
(1) Wm = X a„e?"imnik 
n=0 
for m = 0, 1, 2, ..., k — 1. Moreover, the coefficients a, are given by the 
formula 


k-1 
Q) a,=> Y w,e 7" * for n —0,1,2,..., k=. 
m=0 


PROOF. Let Zm = e?" The numbers zo, z;, ..., Zx- ı are distinct so there is 
a unique Lagrange polynomial 


PEYS Yer 


such that P(z,) = Wm for each m = 0, 1, 2, ..., k — 1. This shows that there 
are uniquely determined numbers a, satisfying (1). To deduce the formula (2) 
for a, we multiply both sides of (1) by e ?*""”/*, where m and r are nonnegative 
integers less than k, and sum on m to get 


k-1 k-1 k-1 , 
25 Wae PME c an »i grin r)mlk. 
m-0 n=0 m=0 


By Theorem 8.1, the sum on m is O unless k|(n — r). But |n — r| k—1so 
k|(n — r) if, and only if, n = r. Therefore the only nonvanishing term on the 
right occurs when n — r and we find 

k-1 

Y we 2rimrik — ka,. 


m=0 


This equation gives us (2). o 
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i i i hich is periodic mod k. Then 
8.4 Let f be an arithmetical function w } nod 
uere isa y determined arithmetical function g, also periodic mod k, 
such that 


k=1 E 
Som) = Y, gneri, 
n=0 
In fact, g is given by the formula 


k-1 
ate) = x Y, fine tren. 


PROOF. Let wm = f(m) for m = 0,1,2,..., k — 1 and apply Theorem 8.3 to 
determine the numbers ao, a, . . ., ay. ,. Define the function g by the relations 
g(m) = am for m = 0, 1,2, ..., k — 1 and extend the definition of g(m) to all 
integers m by periodicity mod k. Then fis related to g by the equations in the 
theorem. 


Note. Since both f and g are periodic mod k we can rewrite the sums in 
Theorem 8.4 as follows: 


) f(m)- Y, geri 
nmodk 
and 
[7 a(n) =~ fimen, 
mmodk 


In each case the summation can be extended 
system modulo k. The sum in (3) is called th 
the numbers g(n) defined by (4) are called t 


over any complete residue 
finite Fourier expansion of f and 
he Fourier coefficients of f 


8.3 Ramanujan’s sum and generalizations 


In Exercise 2.14(b) it is shown that the Möbius function yu(k) is the sum of the 
primitive kth roots of unity. In this section we generalize this result. Specifi- 
cally, let n be a fixed positive integer and consider the sum of the nth powers 


of the primitive kth roots of unity. This sum is known as Ramanujan’s sum 
and is denoted by c,(n): 
en) = 5i girimnlk. 
mmod k 
(m,k)=1 


We have already noted that this sum reduces to the Möbius function when 
me 


Hk) = c1). 
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8.3: Ramanujan’s sum and generalizations 


When k|n the sum reduces to the Euler q function since each term is 1 and 
the number of terms is q(k). Ramanujan showed that c,(n) is always an 
integer and that it has interesting multiplicative properties. He deduced these 
facts from the relation 
(5) a= D, (S. 
d|(n, k) 

This formula shows why c,(n) reduces to both p(k) and o(k). In fact, when 
n = 1 there is only one term in the sum and we obtain ¢,(1) = p(k). And when 
k|n we have (n, k) = k and c,(n) = Yn du(k/d) = q(k). We shall deduce (5) 
as a special case of a more general result (Theorem 8.5). 

Formula (5) for c,(n) suggests that we study general sums of the form 


© Y rad. 
d|(n, k) 


These resemble the sums for the Dirichlet convolution f + g except that we 
sum over a subset of the divisors of k, namely those d which also divide n. 
Denote the sum in (6) by s(n). Since n occurs only in the ged (n, k) we have 


Sin + k) = s(n) 


so s,(n) is a periodic function of n with period k. Hence this sum has a finite 
Fourier expansion. The next theorem tells us that its Fourier coefficients are 
given by a sum of the same type. 


Theorem 8.5 Let s,(n) = Xan, w f (d)g(k/d). Then s,(n) has the finite Fourier 
expansion 


(7) s(n) = X a,(m)e?*inn/* 
mmodk 
where 
k\d 
8 a,(m) = Ae. 
© sim) = >a i) i 
Proor. By Theorem 84 the coefficients a,(m) are given by 
1 " 
a(m) — r 2, s(n)e 2r 
k nmodk 
ah Y Y. f@g k — 2rinm|k 
kazi J d i 
d|k 


Now we write n — cd and note that for each fixed d the index c runs from 1 
to k/d and we obtain 


1 kN ue ; 
am = 7 5 rax.) Y, granite. 
c=1 
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Now we replace d by k/d in the sum on the right to get 


a,(m) = — » Xe saa) pane mea, 


But by Theorem 8.1 the sum on c is 0 unless d|m in which case the sum has 
the value d. Hence 


am) = — 2 UC Jti 


which proves (8). Oo 
Now we specialize f and g to obtain the formula for Ramanujan’s sum 
mentioned earlier. 
Theorem 8.6 We have 
k 
an= Y a 
dni) Md 
Proor. Taking f(k) = k and g(k) = u(k) in Theorem 8.5 we find 
» a) = » a,(m)e2*imnlk 
d|(n, k) d mmodk 
where 
am Z up- [I ]- fi eoa 
diim, Hs (m, k) 0 if(m k)» 1. 
Hence 
k 
du 5| = 2nimnjk _ 
E ri i) P mil = cn), o 


(m,k)=1 


8.4 Multiplicative properties of the 
sums s(n) 


Theorem 8.7 Let 


wo) = Y ru 
a(n, k) d 


where f and g are multiplicative. Then we have 


(9) Smu(ab) = s,,(a)s,(b) whenever (a, k) 
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— (b, m) — 1. 


8.4: Multiplicative properties of the sums s(n) 


In particular, we have 


(10) Sp(ab) = s,,(a) if (b, m) = 1, 
and 
(11) Sa) = Smla)g(k) if (a, k) = 1. 


Pnoor.: The relations (a, k) = (b, m) = 1 imply (see Exercise 1.24) 
(mk, ab) = (a, m) (k, b) 
with (a, m) and (b, k) relatively prime. Therefore 
m= Y sE) y sla) 
d|(mk, ab) d\(a,m)(b, k) 
Writing d = d,d; in the last sum we obtain 


sulab)= Y y faz) 


di|(a, m) d3|(b, k) 
m k 
= X fi a7) DET a$) = Sm(a)s,(b). 
d,|(a,m) 1/7 d2|(b,k) 2, 
This proves (9). 
Taking k = 1 in (9) we get 
Sm(ab) = s,(a)s;(b) = Sla) 
since s,(b) = f(1)g(1) = 1. This proves (10). Taking b = 1 in (9) we find 
Smi(@) = Sm(@)s,(1) = Smla)g(k) 
since s,(1) = f(1)g(k) = g(k). This proves (11). a 
EXAMPLE For Ramanujan’s sum we obtain the following multiplicative 
properties: 
Crab) = c(a)c,(b) whenever (a, k) = (b, m) = 1, 
Cm(ab) = c,,(a) whenever (b, m) = 1, 
and 
cQ(a) = Cm(@) E(k) whenever (a, k) = 1. 


Sometimes the sums s,(n) can be evaluated in terms of the Dirichlet 
convolution f * g. In this connection we have: 


Theorem 8.8 Let f be completely multiplicative, and let g(k) = u(k)h(k), where 
h is multiplicative. Assume that f(p) # 0 and f(p) # h(p) for all primes D, 
and let 


sie) = X faq). 
d|(n. k) 
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Then we have 


F(k)g(N) 


syn) = F(N) ° 


where F = f *g and N = k/(n, k). 


Pnoor. First we note that 
F(k) => f oF} E )- » (s udhia) = f(k) Ys d h(d) 
dj d (d) —— 


f(d) 
h(p) 
= fk 
= fen(r- 55). 


Next, we write a = (n, k), so that k = aN. Then we have 


n= k k — N N 
s(n) ys of i i) = i so (e) 
- p LC ) (Nd)k(Nd). 


Now p(Nd) = u(N)u(d) if (N, d) = 1, and (Nd) = 


equation gives us O if (N, d) > 1, so the last 


5,0) = H(N)h(N) L f (Guana = f()(N)(N) Y, pd) hla) 
waa 4 SA 


(N,d)=1 


E x 
= fW - Tae SUNN PEN. a TO) 


n 1 Ha) 
PIN 3A (p) 
= flaunynn) £0 f - j- FÜOMNYKN) — FON) 
Stl) FC F(N) FN) ` B 


ExAMPLE For Ramanujan's sum we obtain the following simplification: 


i k 

olin) 

en) = e()u(N)/g(N) = E Br 
Pn, kj 


8.5: Gauss sums associated with Dirichlet characters 


8.5 Gauss sums associated with Dirichlet 
characters 


Definition For any Dirichlet character y mod k the sum 
k 
G(n, x) = 9; xm? 
m=1 
is called the Gauss sum associated with y. 


If x = x, the principal character mod k, we have y,(m) = 1 if (m, k) = 1, 
and y;(m) = 0 otherwise. In this case the Gauss sum reduces to Ramanujan's 
sum: 


k 
G(uy)- Y erin (m) 
inet 


Thus, the Gauss sums G(n, y) can be regarded as generalizations of Ramanu- 
jan's sum. We turn now to a detailed study of their properties. 

The first result is a factorization property which plays an important role 
in the subsequent development. 


Theorem 8.9 If y is any Dirichlet character mod k then 
G(n, x) = X(n)G(1, x) whenever (n, k) = 1. 


Pnoor. When (n, k) = 1 the numbers nr run through a complete residue 
system mod k with r. Also, |x(n) ^ = x(n)z(n) = 1 so 


Xr) = xin) = ximxr). 


Therefore the sum defining G(n, y) can be written as follows: 


G(n, x) = 2. tren = x(n) Y, x(nr)e?=i"/k 
ub rmodk 


= 3e) 2 mec = x(n)G(1, x). 
This proves the theorem. n 
Definition The Gauss sum G(n, y) is said to be separable if 
(12) G(n, x) = x(m)G(L, x). 
Theorem 89 tells us that G(n, x) is separable whenever n is relatively 
prime to the modulus k. For those integers n not relatively prime to k we 


have the following theorem. 
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Theorem 8.10 If y is a character mod k the Gauss sum G(n, y) is separable for 
every n if, and only if, 
G(n, x) =0 whenever (n, k) > 1. 


PROOF. Separability always holds if (n, k) = 1. But if (n,k) > 1 we have 
7(n) = 0 so Equation (12) holds if and only if G(n, y) = 0. Oo 


The next theorem gives an important Consequence of separability. 
Theorem 8.11 If G(n, x) is separable for every n then 
(13) IG, x) = k. 


Proor. We have 


Py Y 
IG, YI? = GU, NETH = G(1, x) È me inm 


k k 
G(m, em ?rimik cL » X X(r)e?rimrik, - 2nim/k 
1 m=1r=1 


ll 


M> im 


` 
Qi 


k 
xr) 7 s e = kx(1) = k, 


T m- 


since the last sum over m is a Beometric sum which vanishes unless r = 1, 


8.6 Dirichlet characters with nonvanishing 
Gauss sums 


For every character x mod k we have seen that 
and that separability of G(n, 2) is equivalent to the vanishi: 
(n, k) > 1. Now we describe further properties of those cha: 
G(n, y) = 0 whenever (n, k) > 1. Actually, 
mentary set. The next theorem Bives a ne 
nonzero for (n, k) > 1. 


rtie racters such that 
It is simpler to study the comple- 
cessary condition for G(n, x) to be 


Theorem 8.12 Let y bea Dirichlet character mod k and assume that G(n, x) #0 


for some n Satisfying (n, k) > 1, Then there exists a divisor d of k, d < k, 
such that 


(14) x(a) — 1 whenever (a, k)=1andaz=1 (mod d), 
Proor. For the given n, let 4 = (n, k) and let d = k/q. Then 


q > 1, we have d < k. Choose any a satisfying (a, k) 
We will prove that x(a) = 1. . 


d|k and, since 
—-landaz| (mod 4). 
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Since (a, k) = 1, in the sum defining G(n, y) we can replace the index of 


summation m by am and we find 


G(n3)- Y x(m)""*— LY yxlam)e?™rarik 
mmodk ELM 
= x(a) »» y(m)e?rinemh. 
mmod k 


Since a = 1 (mod d) and d = k/q we can write a = 1 + (bk/q) for some 
integer b, and we have 


anm nm | bknm nm fe bnm E m (mod 1) 


k k qk k 


since q|n. Hence e?"inen/k = e?ninmk and the sum for G(n, y) becomes 
G(n, x) = x(a) Y, x(me^"""" = x(a)G(n, x). 
mmod k 
Since G(n, x) # 0 this implies x(a) = 1, as asserted. Oo 


The foregoing theorem leads us to consider those characters y mod k 
for which there is a divisor d < k satisfying (14). These are treated next. 


8.7 Induced moduli and primitive characters 


- Definition of induced modulus Let y be a Dirichlet character mod k and let d 


be any positive divisor of k. The number d is called an induced modulus 
for x if we have 


(15) x(a) 1 whenever (a, k) = 1 and a = 1 (mod d). 


In other words, d is an induced modulus if the character y mod k acts like 
a character mod d on the representatives of the residue class 1 mod d which 
are relatively prime to k. Note that k itself is always an induced modulus 


for x. 


Theorem 8.13 Let y be a Dirichlet character mod k. Then 1 is an induced 
modulus for x if, and only if, x = X1 


Pnoor. If y = y, then x(a)'— 1 for all a relatively prime to k. But since every a 
satisfies a = 1 (mod 1) the number 1 is an induced modulus. 

Conversely, if 1 is an induced modulus, then x(a) = 1 whenever (a, k) = 1, 
so x = x; since y vanishes on the numbers not prime to k. oO 
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For any Dirichlet character mod k the modulus k itself is an induced 
modulus. If there are no others we call the character primitive. That is, 
we have: 


Definition of primitive characters A Dirichlet character y mod k is said to be 
Primitive mod k if it has no induced modulus d < k. In other words, 
x is primitive mod k if, and only if, for every divisor d of k, 0 < d < k, 
there exists an integer a = 1 (mod d), (a, k) = 1, such that xla) # 1. 


If k > 1 the principal character % is not primitive Since it has 1 as an 
induced modulus. Next we show that if the modulus is prime every non- 
principal character is primitive. 


Theorem 8.14 Every nonprincipal character x modulo a prime p is a primitive 
character mod p. 


Proor. The only divisors of P are 1 and p so these are the only candidates 
for induced moduli. But if X # x, the divisor 1 is not an induced modulus 
so x has no induced modulus < p. Hence y is primitive. Im] 


Now we can restate the results of Theorems 8.10 through 8.12 in the 
terminology of primitive characters, 


Theorem 8.15 Let x be a primitive Dirichlet character mod k. Then we have: 


(a) G(n, y) = O for every n with (n, k) > 1. 
(b) G(n, y) is separable for ever y n. 
(c) |G(1, 2) =k. 


PROOF. If G(n, y) 4 0 for Some n with (n, k) > 1 th 
that y has an induced modulus d aie rr Ni 


8.8 Further properties of induced moduli 


The next theorem refers to thea 


€ ction of y on numbers Which are congruent 
modulo an induced modulus, 


Theorem 8.16 Let y be a Dirichlet character mod k and ass 


ume d|k. . 
Then d is an induced modulus Sor x if, and only if, Puch 


(16) x(a) = x(b) whenever (a, k) = (b, k)=landa=b (mod 4). 
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Proor. If (16) holds then d is an induced modulus since we may choose b =.1 
and refer to Equation (15). Now we prove the converse. 

Choose a and b so that (a, k) = (b, k) = 1 and a = b (mod d). We will 
show that x(a) = x(b). Let a’ be the reciprocal of a mod k, aa’ = 1 (mod k). 
The reciprocal exists because (a, k) = 1. Now aa’ = 1 (mod d) since d|k. 
Hence x(aa') = 1 since d is an induced modulus. But aa’ = ba’ = 1 (mod d) 
because a = b (mod d), hence x(aa') = x(ba'), so 


xla)xla') = x(b)x(a'). 


But x(a’) # 0 since x(a)x(a) = 1. Canceling x(a’) we find x(a) = x(b), and 
this completes the proof. Ba 


Equation (16) tells us that y is periodic mod d on those integers relatively 
prime to k. Thus y acts very much like a character mod d. To further explore 
this relation it is worthwhile to consider a few examples. 


EXAMPLE 1 The following table describes one of the characters y mod 9. 


We note that this table is periodic modulo 3 so 3 is an induced modulus for x. 
In fact, y acts like the following character y modulo 3: 


n 1 2 3 

y(n) 1 -1 0 
Since y(n) = Y(n) for all n we call y an extension of y. It is clear that whenever x 
is an extension of a character Y% modulo d then d will be an induced modulus 


for x. 


EXAMPLE 2 Now we examine one of the characters y modulo 6: 


In this case the number 3 is an induced modulus because x(n) — 1 for all 
n = 1 (mod 3) with (n, 6) = 1. (There is only one such n, namely, n = 1) 


169 


8: Periodic arithmetical functions and Gauss sums 


However, x is not an extension of any character modulo 3, because the 
only characters modulo 3 are the principal character w,, given by the table: 


n 1 2 as 
a et 
Vin) 1 1 0 


and the character y shown in Example 1. Since x(2) = 0 it cannot be an 
extension of either y or y. 


These examples shed some light on the next theorem. 


Theorem 8.17 Let y be a Dirichlet character modulo k and assume d|k, d > 0. 
Then the following two statements are equivalent : 
(a) d is an induced modulus for x. 
(b) There is a character y modulo d such that 


(17) x(n) = V(n)s(n) | for all n, 
where y, is the principal character modulo k. 


Proor. Assume (b) holds. Choose n satisfying (n, k) = l,nz1 (modd). 
Then x(n) = Y(n) = 1 so x(n) = 1 and hence d is an induced modulus, Thus, 
(b) implies (a). 

Now assume (a) holds. We will exhibit a character V modulo d for which 
(17) holds. We define Y(n) as follows: If (n, d) > 1, let V(n) = 0. In this case 
we also have (n, k) > 1 so (17) holds because both members are zero. 

Now suppose (n, d) = 1. Then there exists an integer m such that m = 
n (mod d), (m, k) = 1. This can be proved immediately with Dirichlet's 
theorem. The arithmetic Progression xd + ins infini i 


Wn) = x(m). 
The number y(n) is well-defined because 
which are congruent modulo d and relatively prime to k. 


The reader can easily verify that y is, indeed, a charact d 

verify that Equation (17) holds for all n. i SS anoe alg chal 
If (n,k)= 1 then (n, d) 

Hence, by Theorem 8.16, 


x takes equal values at numbers 


= 1 so Y(n) = y(m) for some m =n (mod d). 


x(n) = x(m) = Wn) = V(n)y, (n) 
since y,(n) — 1. 
If (n, k) > 1, then y(n) = y,(n) = 


0 and both members of (17 
(17) holds for all n. (17) are 0. Thus, 


LI 
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8.9 The conductor of a character 


Definition Let y be a Dirichlet character mod k. The smallest induced modulus 
d for y is called the conductor of y. 


Theorem 8.18 Every Dirichlet character y mod k can be expressed as a product, 


(18) x(n) = Vn), (n) for all n, 


where y, is the principal character mod k and V is a primitive character 
modulo the conductor of y. 


Proor. Let d be the conductor of y. From Theorem 8.17 we know that y can 
be expressed as a product of the form (18), where y is a character mod d. 
Now we shall prove that y is primitive mod d. 

We assume that y is not primitive mod d and arrive at a contradiction. 
If y is not primitive mod d there is a divisor q of d, q < d, which is an induced 
modulus for y. We shall prove that this q, which divides k, is also an induced 
modulus for y, contradicting the fact that d is the smallest induced modulus 
for x. 

Choose n = 1 (mod 4), (n, k) = 1. Then 


x(n) = Vn) (n) = Yn) = 1 


because q is an induced modulus for y. Hence q is also an induced modulus 
for y and this is a contradiction. o 


8.10 Primitive characters and separable 
Gauss sums 


As an application of the foregoing theorems we give the following alternate 
description of primitive characters. 


Theorem 8.19 Let y be a character mod k. Then x is primitive mod k if, and 
only if, the Gauss sum 


G(n = , xm?" 


mmod k 
is separable for every n. 


Proor. If y is primitive, then G(n, y) is separable by Theorem 8.15(b). Now 
we prove the converse. 

Because of Theorems 8.9 and 8.10 it suffices to prove that if y is not 
primitive mod k then for some r satisfying (r, k) > 1 we have G(r, y) # 0. 
Suppose, then, that y is not primitive mod k. This implies k > 1. Then y has a 
conductor d < k. Let r = k/d. Then (r,k) > 1 and we shall prove that 
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G(r, x) # 0 for this r. By Theorem 8.18 there exists a primitive character 
y mod d such that y(n) = v(n)y,(n) for all n. Hence we can write 


G(r, 2 2 2, Men, mje?ntm^ = wae, Member 


(m, k)=1 
= anima _ Q(k) 2nim|d 
co ee cut cad 
(m,k)-1 (m,d)=1 > 


where in the last step we used Theorem 5.33(a). Therefore we have 


_ 9t) 
Gtr, 2) = FUL, y). 


But |G(1, )|? = d by Theorem 8.15 (since y is 


primitive mod d) and hence 
G(r, x) # 0. This completes the proof. 


o 


8.11 The finite Fourier series of the Dirichlet 
characters 


Since each Dirichlet character x mod k is periodic mod k it has a finite 
Fourier expansion 


k 
(19) x(m) = E a,(n)e? imn. 


=1 


and Theorem 8.4 tells us that its Coefficients are given by the formula 


ims i 
an) =F ) y(m)e™ rim. 
m=1 
The sum on the right is a Gauss sum G(—n, y) so we have 


(20) aln) = z G(—n, y). 


Wher is primitive the F. ourier expansion (19) can be expressed as follows: 


Theorem 8.20 The finite F ourier expansion of a primitive Dirichlet character 
x mod k has the form 


(21) =i) ee 
i x(m) Vik p Une lk 

where 

(22) a= Hb 1 x 


St ee 


The numbers (y) have absolute value 1. 
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Proor. Since y is primitive we have G(—n, y) = X —n)G(1, y) and (20) 
implies a,(n) = X( —n)G(1, x)/k. Therefore (19) can be written as 


6G x) s. - ; Gp E ; 
y(m) = (1, x) Y gcnere = a Y: nye rimak, 


k 


which is the same as (21). Theorem 8.11 shows that the numbers z,(y) have 
absolute value 1. o 


n=1 


8.12 Pólya's inequality for the partial sums 
of primitive characters 


The proof of Dirichlet's theorem given in Chapter 7 made use of the relation 


È x(n) 
máx 


< olk) 


which holds for any Dirichlet character y mod k and every real x > 1. This 
cannot be improved when y = y, because Y*., y,(m) = q(k). However, 
Pólya showed that the inequality can be considerably improved when y is a 
primitive character. 


Theorem 8.21 Pólya's inequality. If y is any primitive character mod k then 
for all x > 1 we have 


« Jk log k. 


Proor. We express y(m) by its finite Fourier expansion, as given in Theorem 
8.20 


È xm) 


msx 


(23) 


k 
xm) = Ni $ amen iim, 


n=1 


and sum over all m < x to get 


-1w —2nimn/k 
PELA y PE Ye 


msx 


since y(k) = 0. Taking absolute values and multiplying by Jk we find 


(24) Vk 


say, where 


E 


k 
Yaxm|s Y 


msx n=1 


X g^ 2nimnjk 
msx 


k-1 
= LIfe), 


f(n) = x e 2rimnik 


msx 
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Now 
fk—n= 3x g^ 2rim(k-nyk _ X e2timn/k _ Tiny 
msx msx 
so 
If(k — n)| = | f(n)|. Hence (24) can. be written as 
Q5) y 23m|x2 Y | fn. 
msx nsk/2 


Now f(n) is a geometric sum of the form 


fe)- yy 


where r = [x] and y = e-2zisk- Here y z 1 since 1 < n X k — 1. Writing 
z — e^" we have y = 2? and 2? * lsincen < k/2. Hence we have 


t Edge aere "zz 
et ae z—g1 
so 
. mm 
ru e ririk _ grirnjk [sin ES 1 
Q6) [f(n)| = |= r= 


= < ^ 
sin ™” . nn 

7 sin — 
k k 


quality sin t > 2t/n, valid for 0 < t < 7/2, with t = 


Now we use the ine 


nn/k to get 
1 k 
n|z-—-—. 
If()| 2m^x 
Tk 
Hence (25) becomes 
1 
y xm | <k by — < k log k, 
máx nskj2 M 
and this proves (23). oO 
Fv In a later chapter we will prove that Polya’s inequality can be 
extended to any nonprincipal character, For imiti i 
beer iik, nonprimitive characters it 


PEL = O/k log k). 
(See Theorem 13.15.) 
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Exercises for Chapter 8 


1. Let x = e?*/" and prove that 


2. Let ((x)) = x — [x] — 1 if x is not an integer, and let ((x)) = 0 otherwise. Note that 
((x)) is a periodic function of x with period 1. If k and n are integers, with n » 0, 


prove that 
(&) 1 bi ot ™ sin 2nkm 
-]]= - = ¥ cot— $ 
n 2n zi n n 


3. Let c,(m) denote Ramanujan's sum and let M(x) = Y, sx nn), the partial sums of the 
Möbius function. 


(a) Prove that 
Yedm) = > ar). 
k=1 djm d, 
In particular, when n = m, we have 
YXa(m)- Y am(), 
k=1 dim 
(b) Use (a) to deduce that 


d 
Min) = m y; Mel Y ad) 


dm d Ksi 


Soim= rop; 


. Let n, a, d be given integers with (a, d) = 1. Let m = a + gd where 4 is the product 
(possibly empty) of all primes which divide'n but not a. Prove that 


(c) Prove that 


m =a (mod d) and (m, n) — 1. 
. Prove that there exists no real primitive character x mod k ifk = 2m, where mis odd. 


. Let y bea character mod k. If k; and k; are induced moduli for y prove that so too is 
(ky, k2), their gcd. 


- Prove that the conductor of y divides every induced modulus for x. 


In Exercises 8 through 12, assume that k — kıkz +-+- k,, where the positive 
integers k; are relatively prime in pairs: (k; k) = lif i s j. 


8. (a) Given any integer a, prove that there is an integer a; such that 


a; = a (mod kj) and a; = 1 (mod kj) forallj z i. 
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(b) Let x bea character mod k. Define y; by the equation 
xda) = x(aj, 
where a; is the integer of part (a). Prove that y; is a character mod k;. 


9. Prove that every character y mod k can be factored uniquely as a product of the 
form x = X1 X2 ``“ X», Where y; is a character mod k;. 


10. Let f(x) denote the conductor of x. If y has the factorization in Exercise 9, prove 


that f0) = f&a) f 02). 
11. If y has the factorization in Exercise 9, prove that for every integer a we have 
AN Ai 
G(a, x) = Iit )e«. x). 
iz1 i, 
where a; is the integer of Exercise 8. 


12. If y has the factorization in Exercise 9, prove that x is primitive mod k if, and only if, 
each x; is primitive mod k,. [Hint: Theorem 8.19.] 


13. Let x bea primitive character mod k. Prove that if N < M we have 
M 

xm) 
x — 


m=N+1 m 


2 
E ek 
*y VE loe 


14. This exercise outlines a slight improvement in Pól 
of Theorem 8.21. After inequality (26) write 


ya's inequality. Refer to the proof 


X es g — S d 
n 1 


msk/2 nskj2 |. 


Show that the integral is less than —(k/n)log(sin(z/2k)) and deduce that 


È x(n) < Ji += Jk logk. 


nsx 
This improves Pólya’s inequality by a factor 2/r in the principal term. 
15. The Kloosterman sum K(m, n; k) is defined as follows: 


K(m, n; k) = X ginimh nh' yk 
hmodk 
(hk)=1 
where h' is the reciprocal of h mod k. When k|n this reduces to Ramanujan's sum 
c,(m). Derive the following Properties of Kloosterman sums: 
(a) K(m, n; k) = Kín, m; k). 


(b) K(m, n; k) = K(1, mn; k) whenever (m, k) = 1. 
(c) Given integers n, k 1» k such that (ki, k;) = 1, show that there exist integers ny 
and n; such that 


n = nk; + njk,? (mod k,k3), 
and that for these integers we have 


Kim, n; kika) = K(m, n3; ky)K(m, nz; kz). 
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This reduces the study of Kloosterman sums to the special case K(m, n; p^), where 
p is prime. 
If n and k are integers, n > 0, the sum 
G(k;n)- » girikrin 
r=1 
is called a quadratic Gauss sum. Derive the following properties of quadratic Gauss 


sums: 
(a) G(k; mn) = G(km; n)G(kn; m) whenever (m, n) = 1. This reduces the study of 


Gauss sums to the special case G(k; p*), where p is prime. 
(b) Let p be an odd prime, p¥k, « > 2. Prove that G(k; p") = pG(k; p^?) and 


deduce that 
p if æ is even, 
G(k; p) = i 
(k; p°) d. V2G(k; p) if wis odd. 


Further properties of the Gauss sum G(k; p) are developed in the next chapter 
where it is shown that G(k; p) is the same as the Gauss sum G(k, y) associated with a 
certain Dirichlet character y mod p. (See Exercise 9.9.) 
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Quadratic Residues and the 
Quadratic Reciprocity Law 


9.1 Quadratic residues 
As shown in Chapter 5, the problem of solving a polynomial congruence 
f(x) = 0 (mod m) 


can be reduced to polynomial c 
linear congruences. This cha 
of the form 


ongruences with prime moduli plus a set of 
pter is concerned with quadratic congruences 


(1) x? =n (mod p) 


` where p is an odd prime and n # 0 (mod p). Since the modulus is prime we 
know that (1) has at most two solutions. Moreover, if x is a solution so is 
—x, hence the number of solutions is either 0 or 2. 


Definition If congruence (1) has a solutio; 
residue mod p and we write nRp. If (1) 
a quadratic nonresidue mod p and we w; 


n we say that n is a quadratic 
has no solution we say that n is 
rite nRp. 


Two basic problems dominate the theory of quadratic residues: 


= 


. Given a prime p, determine which n are quadratic residues mod p and 
which are quadratic nonresidues mod p. 
2. Given n, determine those primes p for which n is a 


1 s quadratic residue mod p 
and those for which n is a quadratic nonresidue 


mod p. 
We begin with some methods for solving problem 1. 
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EXAMPLE To find the quadratic residues modulo 1i we square the numbers 
1, 2, ..., 10 and reduce mod 11. We obtain 


P=, 2=4 3229 427 = 5, 5? = 3 (mod 11). 


It suffices to square only the first half of the numbers since 
6 z(-5p23, P=(-4? =5,...,10? =(—-1)? = 1 (mod 11). 


Consequently, the quadratic residues mod 11 are 1, 3, 4, 5, 9, and the non- 
residues are 2, 6, 7, 8, 10. 


This example illustrates the following theorem. 


Theorem 9.1 Let p be an odd prime. Then every reduced residue system mod p 
contains exactly (p — 1)/2 quadratic residues and exactly (p — 1)/2 
quadratic nonresidues mod p. The quadratic residues belong to the residue 
classes containing the numbers 


Q) 15:22:98 4. ded ad 
2,3... (5 


Pnoor. First we note that the numbers in (2) are distinct mod p. In fact, if 
x? = y? (mod p) with 1 < x < (p — 1)/2 and 1 < y < (p — 1)/2, then 


(x — y)(x + y) = 0 (mod p). 
But 1 <x + y < p so x — y = 0 (mod p), hence x = y. Since 
(p — k}? = k? (mod p), 
every quadratic residue is congruent mod p to exactly one of the numbers ~. 
in (2). This completes the proof. . o 


The following brief table of quadratic residues R and nonresidues R was 
obtained with the help of Theorem 9.1. 


p=3 p=5 p=7 p=11 p=13 
R: 1 1,4 1,2,4 LAASI 1, 3, 4, 9, 10, 12 
R: 2 23 3, 5,6 2, 6, 7, 8, 10 2, 5, 6, 7, 8, 11 


9.2 Legendre’s symbol and its properties 


Definition Let p be an odd prime. If n # 0 (mod p) we define Legendre’s 
symbol (n|p) as follows: 


_ f+ dfnRp, 
ep = {* if nRp. 


If n = 0 (mod p) we define (n|p) = 0. 
| 
| 
| 
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ExAMPLES (1|p) = 1, (m° |p) = 1, (7111) = —1, Q2|11) = 0. 
Note. Some authors write 6) instead of (n|p). 


It is clear that (m|p) = (n|p) whenever m = n (mod p), so (n|p) is a 
periodic function of n with period p. 
The little Fermat theorem tells us that n"^! = 1 (mod p) if p ¥ n. Since 


n?! —j{= (no 12 M 1)(n0- 1/2 + 1) 
it follows that n?" !/? = +1 (mod p). The next theorem tells us that we get 
+1 ifnRp and —1 if nRp. 


Theorem 9.2 Euler's criterion. Let p be an odd prime. Then for all n we have 
(n|p) = n'?-!"? (mod p). 


Proor. If 1 = 0 (mod p) the result is trivial since both members are con- 
gruent to 0 mod p. Now suppose that (n|p) — 1. Then there is an x such 
that x? = n (mod p) and hence 


n?- 12 = (x2)9- 02 = xe-12 1 = (n|p) (mod D). 


This proves the theorem if (n|p) — 1. 
Now suppose that (n|p) = — 1 and consider the polynomial 
f(x) = xen. 


Since f (x) has degree (p — 1)/2 the congruence 


f(x) = 0 (mod p) 


has at most (p — 1)/2 solutions. But the (p 


nc — 1)/2 quadratic residues mod p 
are solutions so the nonresidues are not. H 


ence 
ne- £1 (modp) if(n|p) = —1, 


But n^"? = +1 (mod p) so pe 92 = 4. (n|p) (mod p). This com- 

pletes the proof. o 

Theorem 9.3 Legendre’s symbol 
of n. 


Proor. If p|m or p|n then p|mn so (mn|p) = 0 and either (m|p) = 0 or 
(n|p) = 0. Therefore (mn|p) = (m|p)(n|p) if p|m or pin. 
If p ¥ m and p ¥ n then p f mn and we have 


(mn|p) = (mn)? 1? = mP- 1y2,0- 12 


(n|p) is a completely multiplicative function 


= (m|p)(n|p) (mod p). 
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But each of (mn|p), (m|p) and (n|p) is 1 or — 1 so the difference 
(mn|p) — (m|p)(nlp) 
is either 0, 2, or — 2. Since this difference is divisible by p it must be 0. 
Note. Since (n|p) is a completely multiplicative function of n which is 
periodic with period p and vanishes when p|n, it follows that (n|p) = x(n), 


where y is one of the Dirichlet characters modulo p. The Legendre symbol is 
called the quadratic character mod p. 


9.3 Evaluation of (—1|p) and (2|p) 
Theorem 9.4 For every odd prime p we have 

» jy") d if p = 1 (mod 4), 
a cr cd = if p =3 (mod 4). 


Pnoor. By Euler's criterion we have (—1|p) = (—1)®~'” (mod p). Since 
each member of this congruence is 1 or — 1 the two members are equal. O 


Theorem 9.5 For every odd prime p we have 


i_n- )] 1 if p= +1 (mod 8), 
a a n h -Li if p = +3 (mod 8). 


Proor. Consider the following (p — 1)/2 congruences: 


p-1zs 1(—1)! (mod p) 
2=2-1) (mod p) 
p—3t39(—1) (mod p) 


4=4(-1)* (mod p) 


r= EM (—1)*73/? (mod p), 


where r is either p — (p — 1)/2or (p — 1)/2. Multiply these together and note 
that each integer on the left is even. We obtain 


pot - 
2-4-6---(p—1)= (Eo? *0-1/2 (mod p), 
This gives us 
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Since ((p — 1)/2)! # 0 (mod p) this implies 
20-0? = (—1)9*-9/ (mod p). 


By Euler’s criterion we have 2°-12 = (2|p) (mod p), and since each 
member is 1 or — 1 the two members are equal. This completes the proof. 


Ey 


9.4 Gauss’ lemma 


Although Euler’s criterion gives a straightforward method for computing 
(n|p), the calculation may become prohibitive for large n since it requires 


raising n to the power (p — 1)/2. Gauss found another criterion which 
involves a simpler calculation. 


Theorem 9.6 Gauss’ lemma. Assume n X 0 (mod p) and consider the least 
positive residues mod p of the following (p — 1)/2 multiples of n: 


-1 
(3) n, 2n, 3n, . ..,? 7” 
If m denotes the number of these residues which exceed p/2, then 
(n|p) = (—1)". 
Proor. The numbers in (3) are incongruent mod p. We consider their least 
positive residues and distribute them into two disjoint sets A and B, according 


as the residues are <p/2 or > p/2. Thus 


A= (21,25, ...,a) 


where each a; = tn (mod p) for some t < (p — 1)/2 and 0 < a; < p/2; and 
B= (b, b;,... bp} 

where each b; = sn (mod p) for some s < (p — 1)/2 and p/2 < b, < p. Note 

thatm +k = (p 


t 7 : — 1)/2 since A and Bare disjoint. The number m of elements 
in B is pertinent in this theorem. Form a n 


J ew set C of m elements by sub- 
tracting each b; from p. Thus 


Ceu, Mcr). where €; — p — by. 

Now0 < c, « p/2so the elemen 

of A. We show next that the sets 

Assume that c; = a; for som 
=0 (mod p). Therefore 


ts of C lie in the same interval as the elements 
Aand C are disjoint. 


e pair i and j. Then p — b; = aj, or a; + b; 


in t sn — (t -snz0 (mod p) 
for some s and t with 1 < t < 
přn and 0< s+ t « p. The 
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p/2,1 < s < p/2. But this is impossible since 
refore A and C are disjoint, so their union 


9.4: Gauss’ lemma 


AUC contains m + k = (p — 1)2 integers in the interval CL, (p — 1)2]. 
Hence J 


—1 
A U € — (21725; .. 5 dg, C1y65; -- 5 Com ba. 
Now form the product of all the elements in A U C to obtain 
-1 
A103 +*+ 40,05 7 Cm = Eg !. 


Since c; — p — b, this gives us 


(5 "Yt asap — BA — 83-82 


-(-lFaia;.- °@,byb2 +++ by (mod p) 


= (—1)"n(2n)(3n) --- ( > ! n) (mod p) 


= (—1)"n?- (et) (mod p). 
Canceling the factorial we obtain 
n7? = (—1)" (mod p). 


Euler's criterion shows that (— 1)" = (n|p) (mod p) hence (— 1)" = (n|p) and 
the proof of Gauss' lemma is complete. 


To use Gauss' lemma in practice we need not know the exact value of m, 
but only its parity, that is, whether m is odd or even. The next theorem gives 
a relatively simple way to determine the parity of m. 


Theorem 9.7 Let m be the number defined in Gauss’ lemma. Then 


(p=1)/2 TE 
ma pg n ee 
rn | P 8 


In particular, if n is odd we have 


m= Pi (mod 2). 


t=1 


Pnoor. Recall that m is the number of least positive residues of the numbers 


p—1 
» 2n, 3n, ..., 
n, Zn, 3n 2 n 
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which exceed p/2. Take a typical number, say tn, divide it by p and examine 
the size of the remainder. We have 


m E EE a. where 0 < i x if, 
P Pp p P 
P| p p p P| p t 


say, where 0 <r, « p. The number r, = tn — p[tn/p] is the least positive 


residue of tn modulo p. Referring again to the sets 4 and B used in the proof 
of Gauss' lemma we have 


so 


rs 72,---5Mp-12} = (as 025... Gp D3, . .. , b). 
Recall also that 


-1 
ba EI (21,45... Cj, «209: 04] 


where each ci = p — bj. Now we compute the sums of the elements in these 
sets to obtain the two equations 


(p-1y2 k m 


t=1 i=1 j=1 
and 


@-1/2 k m k m 
È t= Yat c= Za +mp- Y by. 
SI i=1 ji i=1 ji 


In the first equation we replace r, by its definition to obtain 


t m (p-1)2 (p- 12 
Lat Ydj=n yt Sp Y B 


i=1 Pp 


t=1 1-1 
The second equation is 

E n (p-1)2 

mp + La- Yb;- SENI 

iz1 j=1 


t=1 


Adding this to the previous equation we get 
k (p-1y2 (p-1y2 
mpt2)a (1) y top y m 
i=1 1=1 t=1 |P 


- p-1 Cg 
=at yy Hi 


t=1 P 
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Now we reduce this modulo 2, noting that n + 1 =n — 1 (mod 2) and 
p = 1 (mod 2), and we obtain 
esq (—-1)2| 
m z (n — pP +È [=] (mod 2), 


which completes the proof. D 


t= 


9.5 The quadratic reciprocity law 


Both Euler’s criterion and Gauss’ lemma give straightforward though 
sometimes lengthy procedures for solving the first basic problem of the 
theory of quadratic residues. The second problem is much more difficult. 
Its solution depends on a remarkable theorem known as the quadratic 
reciprocity law, first stated in a complicated form by Euler in the period 
1744-1746, and rediscovered in 1785 by Legendre who gave a partial proof. 
Gauss discovered the reciprocity law independently at the age of eighteen 
and a year later in 1796 gave the first complete proof. 

The quadratic reciprocity law states that if p and q are distinct odd primes, 
then (p|q) = (q|p) unless p = q = 3 (mod 4), in which case pla) = — (qlp). 
The theorem is usually stated in the following symmetric form given by 
Legendre. 


Theorem 9.8 Quadratic reciprocity law. If p and q are distinct odd primes, then 
(4) (pla)(alp) = (—1yv- ?«- 1^, 


Proor. By Gauss’ lemma and Theorem 9.7 we have 


(alp) = (—1)" 
where 

(P-1)/2 

m= Y [2] (mod 2). 
^ r=1 |P 

Similarly, 

(pla) = (—1)" 
where 


i 
M 


PIE: ] (mod 2). 


Als 


Hence (p|q)(q|p) = (— 1)"*", and (4) follows at once from the identity 


(p—1)/2 tq (q—1)/2| sp zu ES 1 
6 [s] + [2 adizila i 
) à p E^ q 2 2 
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To prove (5) consider the function 
S(x, y) = qx — py. 


If x and y are nonzero integers then f(x, y) is a nonzero integer. Moreover, 
as x takes the values 1, 2,..., (p — 1)/2 and y takes the values 1.253 
(q — 1)/2 then f(x, y) takes 


values, no two of which are equal since 


IY -= f( y) = f - x y y) 40. 
Now we count the number of values of f(x, y) which are positive and the 
number which are negative. 


For each fixed x we have f(x, y) > 0 if and only if y « qx/p, or y< 
[a7/p]. Hence the total number of positive values is 


X[£] 
x=1 LP 


Similarly, the number of negative values is 


"rp 
»=1ı L4 
Since the number of positive and negative values together is 


pE 

2 2 
this proves (5) and hence (4). o 
Note. The reader may find it instructive to in 


r terpret the foregoing proof 
of (5) geometrically, using lattice points in the pl 


ane. 


At least 150 proofs of the quadratic reciprocity law have been published. 
Gauss himself supplied no less than eight, including a version of the one 


just given. A short Proof of the quadratic reciprocity law is described in an 
article by M. Gerstenhaber [25]. 


9.6 Applications of the reciprocity law 


The following examples show ho 
used to solve the two basic types 
residues. 


w the quadratic reciprocity law can be 
of problems in the theory of quadratic 


EXAMPLE | Determine whether 21 


9 is a quadratic Tesidue or nonresidue 
mod 383. 
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Solution 

We evaluate the Legendre symbol (219/383) by using the multiplicative 
property, the reciprocity law, periodicity, and the special values (— 1|p) and 
(2|p) calculated earlier. 

Since 219 = 3-73 the multiplicative property implies 

(219|383) = (3|383)(73383). 

Using the reciprocity law and periodicity we have 

(3/383) = (383|3)(— 18837 9-44 = _ (13) = —(—1)8- 17 = 1, 
and 

(73/383) = (383|73)( — 1)853- 093- 1/4 = (1873) = (2|73)(9|73) 
z(- 1)737- 8 .—1 


Hence (219|383) = 1 so 219 isa quadratic residue mod 383. 


EXAMPLE 2 Determine those odd primes p for which 3 is a quadratic residue 
_ and those for which it is a nonresidue. 


Solution 
Again, by the reciprocity law we have 
Glp) = (p13)(— IP" 98-4 = (— pe-172(5|3). 
To determine (p|3) we need to know the value of p mod 3, and to determine 
(—1)~? we need to know the value of (p — 1)/2 mod 2, or the value of p 
mod 4. Hence we consider p mod 12. There are only four cases to consider, 
p =1,5,7, or 11 (mod 12), the others being excluded since p is odd. 

Case 1. p = 1 (mod 12). In this case p = 1 (mod 3) so (p|3) = (1|3) = 1. 
Also p = 1 (mod 4) so (p — 1)/2 is even, hence (3|p) = 1. 

Case 2. p = 5 (mod 12). In this case p = 2 (mod 3) so (pl3) = (213) 
(—1) 79/8 — —1. Again, (p — 1)/2 is even since p = 1 (mod 4), so (3|p) 
—1. 

Case 3. p = 7 (mod 12). In this case p = 1 (mod 3), so (p|3) = (113) = 1. 
Also (p — 1)/2 is odd since p = 3 (mod 4), hence (3|p) = — 1. 

Case 4. p = 11 (mod 12). In this case p = 2 (mod 3) so (p|3) = (213) = 
— 1. Again (p — 1)/2 is odd since p = 3 (mod 4), hence (3|p) = 1. 

Summarizing the results of the four cases we find 


3Rp if p = +1 (mod 12) 
3Rp if p = +5 (mod 12). 


9.7 The Jacobi symbol 


To determine if a composite number is a quadratic residue or nonresidue 
mod p it is necessary to consider several cases depending on the quadratic 
character of the factors. Some calculations can be simplified by using an 
extension of Legendre’s symbol introduced by Jacobi. 
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Definition If P is a positive odd integer with prime factorization 
the Jzcobi symbol (n| P) is defined for all integers n by the equation 
© P) = T] enpi”, 
where (n|p;) is the Legendre symbol. We also define (n| 1) = 1. 


The possible values of (n|P) are 1, —1, or 0, with (n| P) = 0 if and only if 
(n, P)> 1. 
If the congruence 
? = n (mod P) 


has a solution then (n|p) = 1 for each prime p; in (6), and hence (n|P) = 1. 
However, the converse is not true since (n| P) can be 1 if an even number of 
factors — 1 appears in (6). 

The reader can verify that the following properties of the Jacobi Symbol 
are easily deduced from properties of the Legendre symbol, 


Theorem 9.9 If P and Q are odd positive integers, we have 


(a) (m|P)(n|P) = (mn|P), 

(b) (n|P)(n|Q) = (n| PQ), 

(c) (m|P) 2 (n|P) whenever m = n (mod P), 
(d) (a?n|P) = (n|P) whenever (a, P) = 1. 


The special formulas for evaluating the Legendre symbols (—1|p) and 
(2|p) also hold for the Jacobi symbol. 


Theorem 9.10 If P is an odd positive integer we have 


(7) (-11P) = (— 18-12 
and 
(8) QIP) = (— 1) 


Proor. Write P = DiP2 ``- Pm Where the prime factors p, 


i : are not necessarily 
distinct. This can also be written as 


Pe ITE seg ge nem - go caua 
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But each factor p; — 1 is even so each sum after the first is divisible by 4. 
Hence 


P214 Sar 1) (mod 4), 
i=1 


or 
1 Lil 
5(P-12 } > (p:— 1) (mod 2) 
2 i312 

Therefore 

C15) = I[C 1p) = [TC n7 = (yr, 

i=1 i=1 

which proves (7). 


To prove (8) we write 
P= Ta *tp-0)-14 Lor -D+ De- DPP- D+. 
Since p; is odd we have p? — 1 = 0 (mod 8) so 
P=1+ eo 1) (mod 64) 
hence L 
HS —-1)- E HS — 1) (mod 8). 
This also holds mod 2, hence 
21P) = fiel) = fJ ner» = (ayo, 
which proves (8). L1 


Theorem 9.11 Reciprocity law for Jacobi symbols. If P and Q are positive odd 
integers with (P, Q) — 1, then 


(P|Q)(Q|P) = (—1y?- 0-14 


Pnoor. Write P = p, --- Pm, Q = q1 -+ - qn, where the D; and q; are primes. 
Then 


(PIQXQIP) = TI Tiela = 1y, 


say. Applying the quadratic reciprocity law to each factor we find that 
USE 1 el Sa 
r= AP ANA N a E m. — 
à 230-7 056-1) PE DE 3 - 1. 
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In the proof of Theorem 9.10 we showed that 
2. le- =} (P — 1) (mod 2), 
i312 2 


and a corresponding congruence holds for X Xa; — 1). Therefore 


_P-1Q-1 
r= —~z (mod 2), 


which completes the proof. DJ 
EXAMPLE 1 Determine whether 888 is a quadratic residue or nonresidue of 
the prime 1999. 


Solution 
^. We have 


(888|1999) = (4|1999)(2| 1999)(111|1999) = (111| 1999). 
To calculate (111| 1999) using Legendre symbols we would write 
(111|1999) — (3]1999)(37| 1999) 


and apply the quadratic reciprocity law to each factor on the right. The 
calculation is simpler with Jacobi symbols since we have 


(11111999) = —(1999}111) = —(11111) = —1. 


Therefore 888 is a quadratic nonresidue of 1999, 


EXAMPLE 2 Determine whether — 1 


04 is a quadratic residue or nonresidue of 
the prime 997, 


Solution 1 
Since 104 — 2.4.13 we have 
(—104|997) = (—1]997)(2|997)(13|997) = —(13|997) 
| = — (997/13) = —(9|13) = —1. 
Therefore — 104 is a quadratic nonresidue of 997. 


9.8 Applications to Diophantine equations 


Equations to be solved in integers are called Diop 


hantine equations after 
Diophantus of Alexandria. An example is the equati 


ion 
9) =x +k 


where k is a given integer. The problem is to decide, 
or not the equation has integer solutions x, y and, 
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for a given k, whether 
if so, to exhibit all of them. 


9.8: Applications to Diophantine equations 


We discuss this equation here partly because it has a long history, going 
back to the seventeenth century, and partly because some cases can be 
treated with the help of quadratic residues. A general theorem states that the 
Diophantine equation 


y = f(x) 

has at most a finite number of solutions if J (x) is a polynomial of degree 
23 with integer coefficients and with distinct zeros. (See Theorem 4-18 in 
LeVeque [44], Vol. 2.) However, no method is known for determining the 
solutions (or even the number of solutions) except for very special cases. 
The next theorem describes an infinite set of values of k for which (9) has 
no solutions. 
Theorem 9.12 The Diophantine equation 

(10) y-2x4k 

has no solutions if k has the form 

(11) k = (4n — 1? — 4m?, 

where m and n are integers such that no prime p = —1 (mod 4) divides m. 


Proor. We assume a solution x, y exists and obtain a contradiction by 


considering the equation modulo 4. Since k = —1 (mod 4) we have 

(12) y? =x? — 1 (mod 4). 

Now y? = 0 or 1 (mod 4) for every y, so (12) cannot be satisfied if x is even 

orifx = —1 (mod 4). Therefore we must have x = 1 (mod 4). Now let 
a=4n-1 

so that k = a? — 4m?, and write (10) in the form 

(13) y! + 4m? = x? + a? = (x + a)? — ax + a’), 

Since x = 1 (mod 4) and a = —1 (mod 4) we have 

(14) x? —ax- a! z1—a- a! = —1 (mod 4), 


Hence x^ — ax +a? is odd, and (14) shows that all its prime factors 
cannot be =1 (mod 4) Therefore some prime p= —1 (mod 4) divides 
x? — ax + a?, and (13) shows that this also divides y? + 4m?. In other words, 


(15) y? = —4m? (mod p) forsome p = —1 (mod 4). 
But p ¥ m by hypothesis, so (—4m?|p) = (-1 lp) = —1, contradicting (15), 


This proves that the Diophantine equation (10) has no solutions when k has 
the form (11). D 
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The following table gives some values of k covered by Theorem 9.12. 


n 0 0 0 OF et 1 1 2 2 2 2 
mite el 2 4 5 x 2 4 2 ue o 5 
k —5 -17 —65 —100 23 11 —37 -73 339 327 279 243 
Note. All solutions of (10) have been calculated when k is in the interval 


—100 < k < 100. (See reference [32].) No solutions exist for the following 
positive values of k « 100: 


k = 6,7, 11, 13, 14, 20, 21, 23, 29, 32, 34, 39, 42, 45, 46, 47, 51, 53, 58, 
59, 60, 61, 62, 66, 67, 69, 70, 74, 75, 77, 78, 83, 84, 85, 86, 87, 88, 90, 
93, 95, 96. 


9.9 Gauss sums and the quadratic 

reciprocity law 
This section gives another proof of the quadratic reciprocity law with the 
help of the Gauss sums 


(16) G(n,y) - Y, x(re?"rin. 


rmodp 


where x(r) = (r|p) is the quadratic character mod p. Since the modulus is 
prime, y is a primitive character and we have the separability property 


(17) G(n, x) = tip)G(L, x) 

for every n. Also, Theorem 8.11 implies that |G(1, x)|? = p. The next theorem 
shows that G(1, x)? is +p. 

Theorem 9.13 If p is an odd prime and y(r) = (r|p) we have 


(18) G(1, x)? = (—1|p)p. 
Proor. We have 


p-1p-1 
GL = Y, Y (ripsIpe?rie*sie, 
r=1 s=1 


For each pair r, s there is a unique t mod p such that s = 


tr (mod p), and 
(rlp)(s|p) = (rip)(trip) = (r° |p)(t|p) = (tip). Hence 


gmat Pah P-1  p-1 
GL, x? = Y, Y (pesi «om — È (tlp) Y; errr «orm. 
t=1 r=1 t=1 r=1 
The last sum on r is a geometric sum given by 


r=1 


Y enn +0/p — í =1 ifpy(it+o, 
p-1 ifp|(l4 t) 
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Therefore 
p-2 p-1 
GUL, x = — Yelp) + (» — D(p — 11p = — È (lp) + p(—11p) 


-(-1lpp 
since 27:4 (t|p) = 0. This proves (18). o 
Equation (18) shows that G(1, x} is an integer, so G(1, x)?~! is also an 


integer for every odd q. The next theorem shows that the quadratic reciprocity 
law is connected to the value of this integer modulo q. 


Theorem 9.14 Let p and q be distinct odd primes and let y be the quadratic 
character mod p. Then the quadratic reciprocity law 


(19) (alp) = (— PM (pla) 
is equivalent to the congruence 
(20) G(1, x^! = (q|p) (mod q). 


Proor. From (18) we have 
(21) G(1, yr! = (— 1 Ipfa Ia pa- 1/2 — (— 1ye- 1q- 1/4 pa- 1/2, 
By Euler's criterion we have p^ */? = (p|q) (mod q) so (21) implies 
(22) G(1, x^! = (— NP" 9«-^(p|g) (mod q). 

If (20) holds we obtain 

(alp) = (— 1)” 9" "^(p|g) (mod q) 

which implies (19) since both members are + 1. Conversely, if (19) holds then 
(22) implies (20). 


The next theorem gives an identity which we will use to deduce (20). 


Theorem 9.15 If p and q are distinct odd primes and if y is the quadratic character 
mod p we have 


(23) auo =(alp) X -- X erp. 
rimodp — rgmodp 
riti +rq=q (mod p) 


Proor. The Gauss sum G(n, y) is a periodic function of n with period p. 
The same is true of G(n, y) so we have a finite Fourier expansion 


G(n xy = Y, ameri, 
iod p 


mm 
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where the coefficients are given by 
1 ; 
(24) a,(m) =— Y, G(n, yfe- 2*imein. 
P nmodp 
From the definition of G(n, y) we have 


Gon, = Y (ipee... E (r pjer 
rı mod p ra mod p 


£ » awe (ra pp Tal pje? tt rap 
rimodp rąmodp 


so (24) becomes 
1 A 
a(m—. Aai rlo) Y e ttre, 
P ri modp rq mod p nmodp 


The sum on n is a geometric sum which vanishes unless rj +++ +r, = 
m (mod p), in which case the sum is equal to p. Hence 


(25) afm) = Y - Y (n--rlp. 


rimodp — rqmodp 
irum m (mod p) 


Now we return to (24) and obtain an alternate expression for a,(m). 


Using the separability of G(n, y) and the relation (n|p)? = (n|p) for odd q 
we find 


1 P 
am) — —G(,3y Y (nipe- 28mm = 1 G4, eG, y) 
P nmodp P 


1 
= » S0, 3X(m|p)G(—1, x) = (m|p)G(1, yy 1 
since 


Gil, 3)G(—1, 5) = G(1, NET.» = |G(1, D? = p. 


In other words, G(1, y)?-1 


= (m|p)a,(m). Taking m = q and usi 25 
obtain (23). 4 8 q and using (25) we 


PROOF OF THE RECIPROCITY LAW. 


To deduce the quadratic reciprocity 1 
from (23) it suffices to show that 4 bn ul 


Q6) LU E 6 nli = 1 (modo, 


rimodp — rgmod 


where the summation indices 75 ---, Tq are subject to the restriction 


(27) Tı +++ + ry q (mod p). 
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If all the indices r,, . . . , r4 are congruent to each other mod p, then their 
sum is congruent to qr; for each j = 1, 2, ... , q, so (27) holds if, and only if, 
qr; = q (mod p), 
that is, if, and only ifr; = 1 (mod p) for each j. In this case the corresponding 
summand in (26) is (1|p) = 1. For all other choices of indices satisfying (27) 
there must be at least two incongruent indices among ry, ..., r,. Therefore 
every cyclic permutation of r;,...,r, gives a new solution of (27) which 
contributes the same summand, (r, - -- r,|p). Therefore each such summand 
appears q times and contributes 0 modulo q to the sum. Hence the only 
contribution to the sum in (26) which is nonzero modulo q is (1|p) — 1. This 

completes the proof. 


9.10 The reciprocity law for quadratic Gauss 
sums 


This section describes another proof of the quadratic reciprocity law based 
on the quadratic Gauss sums 


(28) G(n; m) = y etinm, 


r=1 
If p is an odd prime and p J' n we have the formula 
(29) G(n; p) = (nlp)G(1; p) 
which reduces the study of the sums G(n; p) to the case n — 1. Equation (29) 
follows easily from (28) or by noting that G(n; p) = G(n, x), where x(n) = (n|p), 
and observing that G(n, y) is separable. 

Although each term of the sum G(1; p) has absolute value 1, the sum 
itself has absolute value 0, Jp or ,/2p. In fact, Gauss proved the remarkable 
formula 

m ifm = 1 (mod 4) 

EN , -abua JO ifm = 2 (mod 4) 

e GU; m) = 5 /mQ + DU +e )- i /m if m = 3 (mod 4) 
(14 i/m ifm 0 (mod 4) 


for every m > 1. A number of different proofs of (30) are known. We will 
deduce (30) by treating a related sum 


m-1 
S(a, m) E Ss ean 
r=0 


where a and m are positive integers. If a = 2, then S(2, m) = G(1; m). 

The sums S(a, m) enjoy a reciprocity law (stated below in Theorem 9.16) 
which implies Gauss’ formula (30) and also leads to another proof of the 
quadratic reciprocity law. 
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Theorem 9.16 If the product ma is even, we have 


1 a T 
G1) S(a, m) = E EE Som, a), 


where the bar denotes the complex conjugate. 


Note. To deduce Gauss’ formula (30) we take a = 2 in (31) and observe 
that S(m, 2) = 1 + e^". 


Pnoor. This proof is based on residue calculus. Let g be the function defined 
by the equation 


m-1 
(32) g(z) = X griatz * rm. 
r=0 
Then g is analytic everywhere, and g(0) = S(a, m). Since ma is even we find 


a-1 
g(z + 1) — g(z) = e" Im(o2uios _ 1) = erierim(o2uis L i) Y e2ninz, 
n=0 


Now define f by the equation 


__ 9) 
f(@) = EE 
Then f is analytic everywhere exce 


pt for a first-order pole at each integer, 
and f satisfies the equation 


(33) f * 1) = f(z) + o(z), 

where 

(34) Q(z) = e"m S e2Rinz, 
n=0 


The function o is analytic everywhere. 
At z = 0 the residue of f is 9(0)/(2xi) and hence 


(35) S(a, m) = g(0) = 2zi Res f(z) = Í f(z) dz, 
z-0 " 


where y is any positively oriented simple closed path whose graph contains 
only the pole z = 0 ini 


) in its interior region. We will choose y so that it describes 
a parallelogram with vertices A, A + 1, B + 1, B where 


1 i 1 
AS = = mi 4 = ue ri/4 
2 Re"'^ ang B 2 TRIPS, 
as shown in Figure 9.1. Integrating f along y we have 
A1 B1 B A 
n = IF het I f+ i f 
Y A A*1 B*1 B 
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Figure 9.1 


In the integral [£5 f we make the change of variable w = z + 1 and then 
use (33) to get 


B1 B B B 
f fo) dw | fle+ Dac | fdz + [ oaz 
A*1 A A A 
Therefore (35) becomes 
B A1 B+ 
(36) Same J oz) dz | IOA Í f) dz. 
A A B 
Now we show that the integrals along the horizontal segments from A to 


A + 1 and from B to B + 1 tend to Oas R > + œ. To do this we estimate the 
integrand on these segments. We write 


—.. lel 
(37) fO = vai 1p 


and estimate the numerator and denominator separately. 
On the segment joining B to B 4- 1 we let 


y(t) = t + Re^, where — 5 Sts T 
From (32) we find 
moi nia(t + Re™"!* + p? 
(38) IDOJ < bl ep. ce 


where exp z = e. The expression in braces has real part 


—na(/2tR + R?  /2rR) 
m 
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Since |e**| = e* and exp( na /2rR/m) < 1, each term in (38) has absolute 
value not exceeding exp{ — zaR?/m)exp( — v/PnatR[m). But — 1/2 < t < 1/2, 
so we obtain the estimate 

lal] < me728I2mg - zaR?m, 


For the denominator in (37) we use the triangle inequality in the form 


ferrets le2"*| — 1), 


Since |exp{2ziy(t)}| = exp{—22R sin(x/4)} = exp( —,/2nR}, we find 
Jerri w^ 1| sH- e V2nR 
Therefore on the line segment joining B to B + 1 we have the estimate 
me" 2aRIQ m), - xaR2/m 


If) S a ee Ak — =al) as R > +00, 


A similar argument shows that the integrand tends to 0 on the segment 
joining A to A + 1 as R > +00. Since the length of the path of integration 
is 1 in each case, this shows that the second and third integrals on the right 
of (36) tend to 0 as R — +o. Therefore we can write (36) in the form 


B 
(39) S(a, m) = f 9(z)dz + o(1) as R> +o. 
A 


To deal with the integral Ji € we apply Cauch 
9? around the parallelogram with vertices A, B, a, 
Re". (See Figure 92)S 
parallelogram is 0, so 


(40) f + fo + [> + f'? =0. 


y's theorem, integrating 
—, where a = B + 4 = 
ince ¢ is analytic everywhere, its integral around this 


Figure 9.2 
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Because of the exponential factor e7**"/" in (34), an argument similar to that 
given above shows that the integral of o along each horizontal segment ^ 0 
as R — + œ. Therefore (40) gives us 


B 
[27 fiero as R > 4- oo, 
A -a 
and (39) becomes 
(41) Sam) = f. ole) dz + ofl) as Ro +o, 


where « = Re™/*, Using (34) we find 
2 


a-1 
f q(z) dz = S: ries Imo2ninz Jz = y e rimrlap(, m, n, R), 
-a n=0 


-a n=0 


l(a,m, n, R) = it ES (24% + =y je 


Applying Cauchy's theorem again to the parallelogram with vertices —a, o, 
a — (nm/a), and —« — (nm/a), we find as before that the integrals along the 
horizontal segments > 0 as R > +00, so 


a— mnja 5 2 
I(a, m, n, R) = j e [e (: + =) has +o(1) as R > +0. 


-a- nma 


where 


The change of variable w = ./a/m(z + (nm/a)) puts this into the form 


I(a, m, n, R) = E: e"™? dw + o(1) as R-» +00. 


Letting R — +00 in (41), we find 


a-1 R /a]met/4 
(42) S(a, m) = Y; e^ "imis ft lim f ei"? dw, 
n=0 a Jim f o aste 
By writing T = ./a/mR, we see that the last limit is equal to 
Tezi/4 
lim e dw = I 


To Y- Te*i/^ 


say, where J is a number independent of a and m. Therefore (42) gives us 


(43) S(a, m) = [ss a). 
To evaluate J we take a = 1 and m = 2 in (43). Then S(1,2) = 1 + i and 
S(2, 1) = 1, so (43) implies I = (1 + iJ /2, and (43) reduces to (31). o 
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Theorem 9.16 implies a reciprocity law for quadratic Gauss sums. 
Theorem 9.17 If h > 0, k > 0, h odd, then 


(44) Gh; k) = fee (1 + e7 Py: 


Proor. Take a = 2h, m = k in Theorem 9.16 to obtain 


A + 57h! 


(45) Gih; k) = S(2h, k) = ha 2° 


ki+i 

S(k, 2h) = 
z 7; 28 
We split the sum on r into two parts corresponding to even and odd r. 
Forevenr we writer — 2s wheres — 0,1, 2,...,h — 1. Foroddr we note that 
(r + 2h)? = r? (mod 4h) so the sum can be extended over the odd numbers 
in any complete residue system mod 2h. We sum over the odd numbers in the 
interval h < r < 3h, writing 7 = 2s + h, where s = 0,1,2,..., h — 1. (The 
numbers 2s + h are odd and distinct mod 2h.) This gives us 

2h-1 5 h-1 S h-1 à 
e re JQh) — e r*Qs) (2h) + g^ Tiks +h) (2h) 

p X p 


—nikr?/(2h) 


h-1 
= Dren ai + CE 
s=0 


(1 + e "yg h). 
Using this in (45) we obtain (44). 


9.11 Another proof of the quadratic 
reciprocity law 


Gauss' formula (30) leads to a 
First we note that (30) implies 


G(1; k) = i 1I. k 
if k is odd. Also, we have the multiplicative property (see Exercise 8.16(a)) 
G(m; n)G(n; m) = G(1;mn) if(m,n) = 1. 
Therefore, if p and q are distinct odd primes we have 


G(p; q) = (pla)G(1; q) = (plg)i«- q 
Ga; p) = (alp)G(1; p) = (g|pye- vis. /5 


quick proof of the quadratic reciprocity law. 


and 
G(p; q)G(a; p) = G(1; pq) = itr«- 1^ /pg, 
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Comparing the last equation with the previous two we find 


plaa Ipii Ka-1) *(—-1)?)/4 — j(2— 17/4, 


and the quadratic reciprocity law follows by observing that 


jm 13-(-1)-(-1)9y4 — (= 1e- 1a- 1/4 o 


Exercises for Chapter 9 


2 


Determine those odd primes p for which (—3|p) = 1 and those for which (—3|p) = 
—1. 


Prove that 5 is a quadratic residue of an odd prime p if p = + 1 (mod 10), and that 
5 is a nonresidue if p = +3 (mod 10). 


. Let p be an odd prime. Assume that the set (1,2, ..., p — 1) can be expressed as the 


union of two nonempty subsets S and T, S # T, such that the product (mod p) of any 
two elements in the same subset lies in S, whereas the product (mod p) of any 
element in S with any element in T lies in T. Prove that S consists of the quadratic 
residues and T of the nonresidues mod p. 


Let f (x) be a polynomial which takes integer values when x is an integer. 
(a) If a and b are integers, prove that 


Y, (lax + b)lp = Y (69lp if(a, p) = 1, 

xmodp xmodp 

and that 

Y. (af(x)lp) = (alp) Y, (69Ip) for all a. 
P 


x mod xmodp 
(b) Prove that 
Y (x +d|p)=0 if@p)= i 
xmodp 


(c) Let f(x) = x(ax + b), where (a, p) = (b, p) = 1. Prove that 


p-1 p-1 
X FOP) = Za + bx|p) = — (alp). 
x=1 x= 


[Hint: As x runs through a reduced residue system mod p, so does x’, the 
reciprocal of x mod p.] 


5. Let and f) be integers whose possible values are + 1. Let N(a, f) denote the number 


of integers x among 1,2,..., P — 2 such that 
(xlp 2 «  and(x + 1|p)= $, 


where p is an odd prime. Prove that 


-2 
4N(a, B) = * {1 + a(x|p)}{1 + B(x + 101p. 
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and use Exercise 4 to deduce that 


Na B) = p — 2 — B — aB — o(~1p). 


In particular this gives 
ear 
N(1, 1) = eet 


N(-1, -1) = N-1,1 = 72 + (=p) 
" , 
N(l, -1) 214 N(, 1). 
y prime p there exist integers x and y such that 
x? + y? +1 = 0 (mod p). 
7. Let p be an odd prime. Prove each of the following statements 


p-1 
(à) Yrrlp 0 ifp=1 (mod 4). 
r=1 


p-i z 
( Y [ER D ifp =! (mod 4), 


r= 
(rip=1 


RE p-1 

(c) Y rep) 7 PYr(rlp) ifp=3 (mod 4). 
r= r=1 
p-1 3 P-1 

(d) X rep =5P Llp) ifp=1 (mod 4). 
r= r=1 


p-1 p-1 p-1 
(e) yelp) 7 2p Y rp) - p? Y, rrlp) ifp =3 (mod 4). 
rz r=1 r=1 


(Hint: p — r runs through us numbers 1,2, ..., P — 1 with r.] 


8. Let p be an odd prime, p 2 3 Gand 4), and let q = (p — 1y2. 


(a) Prove that 


— B. 


r=1 


1 = X210} Y reip) = p —e T2093 
r=1 


[Hint: As r runs ien the e numbers La 
th = 
through the numbers 1, P — 1, as do 2r eer pa Ta e —— 
(b) Prove that 


Pet 
(10 - 2} X ri = pS elp 
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9. If p is an odd prime, let x(n) = (n|p). Prove that the Gauss sum G(n, y) associated 
with y is the same as the quadratic Gauss sum G(n; p) introduced in Exercise 8.16 
if (n, p) = 1. In other words, if p ¥ n we have 


Gln, ) = xm = Y etsi = Gf; p). 
mmod p r=1 


It should be noted that G(n, x) # G(n; p) if p|n because G(p, y) = 0 but G(p; p) = p- 


10. Evaluate the quadratic Gauss sum G(2; p) using one of the reciprocity laws. Com- 
pare the result with the formula G(2; p) = (2|p)G(1; p) and deduce that (2|p) = 
(— 197 8 if p is an odd prime. 
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1 O Primitive Roots 


10.1 The exponent of a number mod m. 
Primitive roots 


Let a and m be relatively prime inte, 


gers, with m > 1, and consider all the 
positive powers of a: 


Qa ana 


We know, from the Euler-Fermat theorem, that a” = 1 (mod m). How- 
ever, there may be an earlier power a^ such that af = 1 (mod m). We are 
interested in the smallest positive f with this property. 


Definition The smallest positive integer f such that 


a! = 1 (mod m) 


is called the exponent of a modulo m, and is denoted by writing 


f = exp,,(a). 


If exp,(a) = (m) then a is called a primitive root mod m. 


The Euler-Fermat theorem tells us that €Xp»(a) < (m). The next theorem 
Shows that exp, (a) divides gm). 


Theorem 10.1 Given m > 1, (a, m) = 1, let f = exp,(a). Then we have: 


(a) a‘ = a" (mod m) if, and only if, k = h (mod f). 
(b) à* 2 1 (mod m) if, and only if, k = 0 (mod f). In particular, f |g(m)- 
(c) The numbers 1, a, a?, . .. , a! ^! are incongruent mod m. 
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10.2: Primitive roots and reduced residue systems 


Pnoor. Parts (b) and (c) follow at once from (a), so we need only prove (a). 
If a* = a^ (mod m) then a*^^ = 1 (mod m). Write 


k—h-qf-*r, where0 &r « f. 


Then 1 = a*^^ = af +" =a" (mod m), sor = O and k = h (mod VE 
Conversely, if k =h (mod f) then k — h = qf so a^" = 1 (mod m) 
and hence a‘ = a" (mod m). o 


10.2 Primitive roots and reduced residue 
systems 


Theorem 10.2 Let (a, m) = 1. Then a is a primitive root mod m if, and only if, 
the numbers i 


(1) a, a?, ..., a? 
form a reduced residue system mod m. 


Pnoor. If a is a primitive root the numbers in (1) are incongruent mod m, by 
Theorem 10.1(c). Since there are (m) such numbers they form a reduced 
residue system mod m. 

Conversely, if the numbers in (1) form a reduced residue system, then 
a?™ = 1 (mod m) but no smaller power is congruent to 1, so ais a primitive 


root. 


Note. In Chapter 6 we found that the reduced residue classes mod m form 
a group. If m has a primitive root a, Theorem 10.2 shows that this group is 
the cyclic group generated by the residue class â. 


The importance of primitive roots is explained by Theorem 10.2. If m 
hasa primitive root then each reduced residue system mod m can be expressed 
as a geometric progression. This gives a powerful tool that can be used in 
problems involving reduced residue systems. Unfortunately, not all moduli 
have primitive roots. In the next few sections we will prove that primitive 
roots exist only for the following moduli: 


m = 1,2, 4, p°, and 2p*, 


where p is an odd prime and « > 1. : 
The first three cases are easily settled. The case m — 1 is trivial. For m — 2 


the number 1 is a primitive root. For m = 4 we have @(4) = 2 and pes 
1 (mod 4), so 3 is a primitive root. Next we show that there are no primitive 
roots mod 2* if « > 3. 
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10.3 The nonexistence of primitive roots 
mod 2* for « > 3 


Theorem 10.3 Let x be an odd integer. If a > 3 we have 
(2) x*9?? = 1 (mod 29, 
so there are no primitive roots mod 2. 


PROOF. If « = 3 congruence (2) states that x? = 1 (mod 8) for x odd. This is 
easily verified by testing x = 1, 3, 5, 7 or by noting that 


(2k + 1)? = 4k? + Ak + 1 = Ak 1) 4.1 
and observing that k(k + 1) is even. 


Now we prove the theorem by induction on €. We assume (2) holds for 
æ and prove that it also holds for æ + 1. The induction hypothesis is that 


xem Lu 
where t is an integer. Squaring both sides we obtain 
XP) = 1 261p 4 92452 — 1 (mod 2** !) 
because2a > x + 1. This completes the proof since (25) = 27-1 = e(2** y2 


10.4 The existence of primitive roots mod p 
for odd primes p 


First we prove the following lemma. 
Lemma 1 Given (a, m) = 1, let f = exp,,(a). Then 


exp,(a) 
(f 

exp,,(a) if, and only if, (k, f) = 1. 
Pnoor. The exponent of a* is the smallest posit 


exp, (a*) = 
In particular, exp,,(a") = 
ive x such that 
a* = 1 (mod m). 


This is also the smallest x > 0 such that kx = 
congruence is equivalent to the congruence 


x=0 (moa Ji 


where d = (k, f). The smallest positive Solution of this congruence is f. fa 


0 (mod f). But this latter 


so exp,,(a") = f/d, as asserted. 
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Lemma 1 will be used to prove tne existence of primitive roots for prime 
moduli. In fact, we shall determine the exact number of primitive roots 
mod p. 


Theorem 10.4 Let p be an odd prime and let d be any positive divisor of p — 1. 
Then in every reduced residue system mod p there are exactly (d) numbers 
a such that 


exp,(a) = d. 


In particular, when d = q(p) = p — 1 there are exactly (p — 1) primitive 
roots mod p. 


Proor. We use the method employed in Chapter 2 to prove the relation 
È gld) =n. 
d|n 
The numbers 1, 2, ..., p — 1 are distributed into disjoint sets A(d), each set 
corresponding to a divisor d of p — 1. Here we define 
A(d) = (x:1 < x < p — 1 and exp,(x) = d). 


Let f (d) be the number of elements in A(d). Then f(d) = 0 for each d. Our 
goal is to prove that f (d) = o(d). 

Since the sets A(d) are disjoint and since each x = 1,2, ..., p — 1 falls into 
some A(d), we have 


Y f(a) =p -1. 
d|p- 1 
But we also have 
Y o0)-p-1 
d|p- 1 


so 
(0) - f) = 0. 


To show each term in this sum is zero it suffices to prove that f (d) < (d). 
We do this by showing that either f (d) = 0 or f(d) = o(d); or, in other words, 
that f(d) # O implies f(d) = o(d) — — 
Suppose that f(d) # 0. Then A(d) is nonempty so a € A(d) for some a. 
Therefore 
exp,(a) =d, hence a! = 1 (mod p). 
But every power of a satisfies the same congruence, so the d numbers 


(3) 2,3, <a" 
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are solutions of the polynomial congruence 

(4) x4 — 1 = 0 (mod p), 

these solutions being incongruent mod p since d — exp,(a). But (4) has at 
most d solutions since the modulus is prime, so the d numbers in (3) must be 
all the solutions of (4). Hence each number in A(d) must be of the form a 
for some k = 1,2,..., d. When is exp,(a‘) = d? According to Lemma 1 this 
occurs if, and only if, (k, d) = 1. In other words, among the d numbers in (3) 


there are q(d) which have exponent d modulo p. Thus we have shown that 
f(d) = o(d) if f(d) # 0. As noted earlier, this completes the proof. Im 


10.5 Primitive roots and quadratic residues 


Theorem 10.5 Let g be a primitive root mod p, 


where p is an odd prime. Then 
the even powers 


GAGA. digan 
are the quadratic residues mod p, and the odd powers 


9.8... g^? 
are the quadratic nonresidues mod p. 


Poor. If n is even, say n = 2m, then g^ = (g")? so 
g" = x (mod p), where x = g". 


Hence g"Rp. But there are (p — 1)/2 distinct even powers g^...,g^! 
modulo p and the same number of quadratic residues mod p. Therefore the 


€ven powers are.the quadratic residues and the odd powers are the non- 
residues. oO 


10.6 The existence of primitive roots mod p 


We turn next to the case m — p*, where p is an odd prime and « > 2. In 
seeking primitive roots mod p* it is natural to consider as candidates the 
primitive roots mod p. Let g be such a primitive root and let us ask whether 
g might also be a primitive root mod p?. Now "^! =1 (mod p) and, since 
9(p^) = p(p — 1) > p — 1, this g will certainly not be a primitive root 
mod p? if g^^! = 1 (mod p°). Therefore the relation 


9"! #1 (mod p?) 


is à necessary condition for a primitive root g mod p to also be a primitive 
root mod p?. Remarkably enqugh, this condition is also sufficient for g 
to be a primitive root mod p and, more generally, mod p* for all powers 
* > 2. In fact, we have the following theorem. 
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10.6: The existence of primitive roots mod p* 


Theorem 10.6 Let p be an odd prime. Then we have: 
(a) Ifg is a primitive root mod p then g is also a primitive root mod p* for 
all « > 1 if, and only if, 
(5) g=! #1 (mod p?) 
(b) There is at least one primitive root g mod p which satisfies (5), hence 
there exists at least one primitive root mod p* if a > 2. 


Pnoor. We prove (b) first. Let g bea primitive root mod p.Ifg?~! # 1 (mod p?) 
there is nothing to prove. However, if g^^! = 1 (mod p?) we can show that 
gı =g + p, which is another primitive root modulo p, satisfies the condition 
g:”"1 #1 (mod p°). 
In fact, we have 
g, 1 = (g + pt gi (p — Dg p tp? 

= g^! (p° — pg"? (mod p°) 

= 1 — pg^^? (mod p°). 
But we cannot have pg’~? = 0 (mod p?) for this would imply g’ = 
0 (mod p), contradicting the fact that g is a primitive root mod p. Hence 
gi?! # 1 (mod p°), so (b) is proved. 

Now we prove (a). Let g be a primitive root modulo p. If this g is a primitive 
root mod p* for all « > 1 then, in particular, it is a primitive root mod p? and, 
as we have already noted, this implies (5). 

Now we prove the converse statement. Suppose that g is a primitive 
root mod p which satisfies (5). We must show that g is also a primitive 
root mod p* for all a > 2. Let t be the exponent of g modulo p*. We wish 
to show that t = q(p^). Since g' = 1 (mod p^) we also have g' = 1 (mod p)so 
q(p)|t and we can write. 


(6) t = qp). 
Now t|g(p%) so qo(p)|o(p"). But o(p*) = p* *(p — 1) hence 
q(p — 1)|p* p — 1) 
which means q|p*~!. Therefore q = p^ where f < « — 1, and (6) becomes 
t=p(p— 1) 


If we prove that f| = « — 1 then t = ¢(p*) and the proof will be complete. 
Suppose, on the contrary, that f < a — 1. Then f € a — 2 and we have 


t- pp- Dip ep- 1) = op?) 
Thus, since q(p*^ !) is a multiple of t, this implies, 
(7) g^" ? = 1 (mod p°). 
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i hich shows that (7) is a 
w we make use of the following Lemma w 
fo Rude This contradiction will complete the proof of Theorem 10.6. 


o 
Lemma 2 Let g be a primitive root modulo p such that 


(8) g=! #1 (mod p°). 
Then for every « > 2 we have 


(9) 9°" € 1 (mod p°), 


Pnoor or LEMMA 2. We use induction on a. For « = 2, relation (9) reduces 
to (8). Suppose then, that (9) holds for g. By the Euler-Fermat theorem 
we have 


geo» = 1 (mod p!) 
so 


gr) 214 kp*-! 


Where p ¥ k because of 


(9). Raising both sides of this last relation to the pth 
power we find 


g= (0 kp pap 4 kp* + k? we) D PREY 4 ppe- 1) 
Now 2412441 and 3a — 
equation gives us the congruence 

9) = 1 + kp (mod p**!) 
where p ¥ k. In other Words, g?”" Æ | (mod p 


**!) so (9) holds fora + 1 
if it holds for x. This completes the proof of Le 


mma 2 and also of Theorem 
o 
10.7 The existence of primitive roots 


Theorem 10.7 If p is an odd 
g modulo p*. Each such gi: 


mod 2p* 
prime and « > 


there exist odd primitive roots 
s also a primitive 


root modulo 2p*, 


- Let g be an odd primitive 
Xponent of g mod 2p i 
f = 9(2p"). Now f leQp*), and (2p 
1 (mod 2p’) so 9 =1 (mod if " A 
g is a primitive root mod p*. Therefore f = o) = P) hence POIS since 


o 
210 


10.8: The nonexistence of primitive roots in the remaining cases 


10.8 The nonexistence of primitive roots in 
the remaining cases 


Theorem 10.8 Given m > 1 where m is not of the form m = 1, 2, 4, p*, or 2p*, 
where p is an odd prime. Then for any a with (a, m) — 1 we have 


q? 0)? = | (mod m), 
so there-are no primitive roots mod m. 


Pnoor. We have already shown that there are no primitive roots mod 2* 
if « > 3. Therefore we can suppose that m has the factorization 


m = 2p," p 


where the p; are odd primes, s > 1, and « > 0. Since m is not of the form 
1, 2, 4, p* or 2p* we have a > 2ifs = 1 and s > 2 ifa = O or 1. Note that 


(m) = 9(2*)p(P1*) --- pp). 

Now let a be any integer relatively prime to m. We wish to prove that 
q*0)/? = | (mod m). 
Let g be a primitive root mod p,™ and choose k so that 
a = g* (mod p,”). 

Then we have 
(10) aP = gkoim/2 = gir) (mod p, 
where 

t = ke(2")p(p2) -- - 9(p,7)/2. 


We will show that t is an integer. If « > 2 the factor q(2") is even and hence 
t is an integer. If = O or 1 then s > 2 and the factor (p^) is even, so t is an 
integer in this case as well. Hence congruence (10) gives us 


q?7? = | (mod p,^'). 
In the same way we find 
(1) arm’? = 1 (mod p) 


foreach i = 1,2,...,s. Now we show that this congruence also holds mod 22. 
If æ > 3 the condition (a, m) = 1 requires a to be odd and we may apply 
Theorem 10.3 to write 


a2"? = | (mod 2?) 
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Since ¢(2*)| (m) this gives us 
(12) a%™l2 = 1 (mod 2%) 


fora > 3. 
If æ € 2 we have 


(13) a*?? = | (mod 29. 


But s > 1 so y(m) = 9(2°)g(p,") --- s^) = 2ro(2*) where r is an integer. 
Hence 9(2*)| o(m)/2 and (13) implies (12) for « < 2. Hence (12) holds for all a. 
Multiplying together the congruences (11) and (12) we obtain 


a*)? = 1 (mod m), 


and this shows that a cannot be a primitive root mod m. 


10.9 The number of primitive roots mod m 
We have shown that an integer m > 1 has a primitive root if and only if 
m = 1,2, 4, p* or 2p’, 


where p is an odd prime and « > 1. Th 


€ next theorem tells us how many 
primitive roots exist for each such m. 


Theorem 10.9 Ifm has a primitive root g then m has exactl y e (o(m)) incongruent 
primitive roots and the y are given 


by the numbers in the set 


S={g:l<n< €(m), and (n, 9(m)) = 1). 


PRoor. We have *Xp,(g) = o(m), and Lemma 1 shows that €xp,(g^) = exp,,(g) 
if "c only if (n, (m)) = 1. Therefore each element of S is a Primitive root 
mod m. 

Conversely, if a is a primitive root mod m, then a = g* (mod m) for some 
m(g") = exp, (a) = ym), 


Y Primitive root is a 
members mod m the p 


and Lemma 1 implies 
member of S. Since S 
roof is complete. o 
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Table 10.1 g(p) is the smallest primitive root of the prime p 


gp) | P 9(P) 
3 * 109 6 260 2 439 15 617 3 
3.2 13 3 21 6 443 2 619 D 
5 2 127 3 277." 5 49 3 631 3 
7 8 Bi 2 231 3 457 13 641 2 
nu 2 137 3 283 3 4600 2 643 2 
13; 32 139 2 2033 2 463 3 647 1 
17 3 149 2 307-55 467 2 653 2 
19 2 1531 6 311 17 479 13 659 3 
23. 5 ist S 313 10 487 43 661 2 
29 2 163 -2 317 72 401. = 92 673 5 
31 3 1697 5 331 3 499 7 677 2 
37 5 qu 2 337 10 5035 5 683 3 
4 6 79 2 47 2 59 2 691 2 
43 3 181 2 349 2 521 3 701 5 
47 5 191 19 353, 3 523.2 709 2 
53 4 193 5 359 7 S4 2 719 17 
59 2 197 2 367 6 547... 2 TM 7 
61 2 199 3 375° 2 557, 080. 733 3 
67 2 2.0 2 39 2 563 2 739 5 
UT 225 8 383 5 569 3 743 2 
73, 5 22] 2 38 2 S13 3 751 2 
79 3 22 6 397/99 S 511 ^5 751 3 
83 2 233 3 40 3 587 2 761 5 
89 3 239 7 40 21 5903203 769 6 
97 5 A 7 49 2 599 7 773 3 
101 2 251 6 421 2 60 7 787 5 
103 5 257 3 434 7 607 7 
107 2 263 5 43395) 613 


10.10 The index calculus 


If m has a primitive root g the numbers 1, g, g^, ..., g*"?-' form a reduced 
residue system mod m. If (a, m) — 1 there is a unique integer k in the interval 
0 < k < (m) — 1 such that 

a = g* (mod m). 


This integer is called the index of a to the base g (mod m), and we write 
k = ind; a 


or simply k — ind a if the base g is understood. 
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The following theorem shows that indices have properties analogous to 
those of logarithms. The proof is left as an exercise for the reader. 
Theorem 10.10 Let g be a primitive root mod m. If (a, m) = (b, m) = 1 we have: 


a) ind(ab) = ind a + ind b (mod (m). 
is inda" = ninda (mod g(m) if n> 1. 
(c) indi— 0 and indg- 1. 

(d) ind(—1) = o(m)2 if m» 2. 

(e) If g' is also a primitive root mod m then 


ind, a = ind, a - ind, g' (mod ¢(m)). 


Table 10.2 on pp. 216-217 lists indices for all numbers a # 0 (mod p) and 
all odd primes p < 50. The base g is the smallest primitive root of p. 
The following examples illustrate the use of indices in solving congruences. 


EXAMPLE 1 Linear congruences. Assume m has a primitive root and let 
(a, m) = (b, m) = 1. Then the linear congruence 


(14) 1 ax z b (mod m) 
is equivalent to the congruence 
ind a + ind x = ind b (mod ø(m)), 
So the unique solution of (14) satisfies the congruence 
ind x = ind b — inda (mod o(m)). 
To treat a numerical exemple: consider the linear congruence 
9x = 13 (mod 47). 
The corresponding index relation is 
ind x = ind 13 — ind 9 (mod 46). 
From Table 10.2 we find ind 13 = 11 and ind 9 — 40 (for P = 47), so 
ind x = 11 — 40 = -29 = 17 (mod 46). 
Again from Table 10.2 we find x = 38 (mod 47). 


EXAMPLE 2 Binomial congruences. A congruence of the form 
x" = a (mod m) 


is called a binomial congruence. If m has 


cal i a primitive root and if (a,m) = 1 
this is equivalent to the congruence 


nind x = ind a (mod ¢(m)), 
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which is linear in the unknown ind x. As such, it has a solution if, and only if, 

ind a is divisible by d = (n, g(m)), in which case it has exactly d solutions. 
To illustrate with a numerical example, consider the binomial congruence 

(15) x? = a (mod 17). 

The corresponding index relation is 

(16) 8 ind x = ind a (mod 16). 


In this example d = (8, 16) = 8. Table 10.2 shows that 1 and 16 are the only 
numbers mod 17 whose index is divisible by 8. In fact, ind 1 = Oand ind 16 = 
8. Hence (15) has no solutions if a # 1 or a # 16 (mod 17). 

For a — 1 congruence (16) becomes 


(17) 8 ind x = 0 (mod 16), 
and for a — 16 it becomes 
(18) 8 ind x = 8 (mod 16). 


Each of these has exactly eight solutions mod 16. The solutions of (17) are 
those x whose index is even, 


x = 1,2, 4, 8, 9, 13, 15, 16 (mod 17). 


These, of Course, are the quadratic residues of 17. The solutions of (18) are 
those x whose index is odd, the quadratic nonresidues of 17, 


x = 3, 5, 6, 7, 10, 11, 12, 14 (mod 17). 


EXAMPLE 3 Exponential congruences. An exponential congruence is one of 
the form 


a* = b (mod m). 


If m has a primitive root and if (a, m) — (b, m) — 1 this is equivalent to 
the linear congruence 


(19) x ind a = ind b (mod g(m)). 


Let d = (ind a, g(m)). Then (19) has a solution if, and only if, d|ind b, in 
which case there are exactly d solutions. In the numerical example 


(20) 25* = 17 (mod 47) 


We have ind 25 = 2, ind 17 = 16, and d = (2, 46) = 2. Therefore (19) 
omes 


2x = 16 (mod 46), 


with two solutions, x = 8 and 31 (mod 46). These are also the solutions of 
(20) mod 47. 
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Table 10.2 Indices of all numbers a # 0 (mod p) for odd primes p < 50. The base g is the smallest pri 
root of p. 
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10.11 Primitive roots and Dirichlet characters 


Primitive roots and indices can be used to construct explicitly all the Dirichlet 
characters mod m. First we consider a prime power modulus p^, where p is 
an odd prime and a > 1. hs 

Let g be a primitive root mod p which is also a primitive root mod p^ 
for all B > 1. Such a g exists by Theorem 10.6. If (n, p) = 1 let b(n) = ind, n 
(mod p?) so that b(n) is the unique integer satisfying the conditions 


n= (mod p°), 0 < b(n) < o(p*). 
For h = 0, 1,2, ..., o(p") — 1, define y, by the relations 


e rihbin/o(p?) ifp Kn, 
Q1) xn) = k if pin. 


Using the properties of indices it is easy to verify that Xn is completely 
multiplicative and periodic with period p*, so y, is a Dirichlet character 


mod p*, with xo being the principal character. This verification is left as an 
exercise for the reader. 


Since 
nlg) = eaten 


the characters Xo, y,,..., Xop- 1 are distinct because they take distinct 
values at g. Therefore, since there are (p°) such functions they represent 
all the Dirichlet characters mod p*. The same construction works for the 
modulus 2* if a = 1 or « = 2, using g = 3 as the primitive root. 

Now if m = p,“ --- p.*, where the p; are distinct odd primes, and if y; is a 
Dirichlet character mod Di", then the product y = y, --- y, is a Dirichlet 
character mod m. Since o(m) = 9(p1^) -- - o(p,") we get (m) such characters 
as each y, runs through the o(p) characters mod pi”. Thus we have ex- 
plicitly constructed all Characters mod m for every odd modulus m. 


Ifa 23 the modulus 2* has no primitive root and a slightly different 
Construction is needed to 


x obtain the characters mod 2*. The following 
theorem shows that 5 is a good substitute for a primitive root mod 2*. 


T 10.11 As 23.7 hen for every odd integ rn is a uni 
th uniqui 
determined integer b(n) Such that à ere Hd ely 


n= (12S (mod 29, with 1 < b(n) < (2")/2. 
Pnoor. Let f = exp,.(5) so that 5/ = 1 
(2°)/2. Now f |o(2*) = 2-1, so f = 2 fo 


(mod 2"). We will show that f = 
10.8 we know that 


T some f <  — 1. From Theorem 
52 = 1 (mod 25, 
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hence f < ¢(2%)/2 = 2*7?. Therefore f x « — 2. We will show that f = 
x — 2. 

Raise both members of the equation 5 = 1 + 2? to the f = 2? power to 
obtain 


5/ = (1 + 22^ = 1 + 204?  r29*? = 1 + 2°41 + 27) 


where r is an integer. Hence 5/ — 1 = 2°*?t where t is odd. But 2*|(5/ — 1) 
soa<f+2, or B>a—2. Hence f 2a —2 and f = 2*7? = (2/2. 
Therefore the numbers 


Q2) 5,58 EST 


are incongruent mod 2%. Also each is =1 (mod 4) since 5 — 1 (mod 4). 
Similarly, the numbers 


Q3) —5, —5,..., —55 


are incongruent mod 2? and each is = 3 (mod 4) since —5 = 3 (mod 4). 
There are 2f = (2%) numbers in (22) and (23) together. Moreover, we cannot 
have 5¢= —5^ (mod 2%) because this would imply 1 = —1 (mod 4). 
Hence the numbers in (22) together with those in (23) represent (2°) in- 
congruent odd numbers mod 2%. Each odd n = 1 (mod 4) is congruent 
mod 2* to one of the numbers in (22), and each odd n = 3 (mod 4) is con- 
Bruent to one in (23). This proves the theorem. O 


With the help of Theorem 10.11 we can construct all the characters 
mod 2* ifa > 3. Let 


(—1)"-)? ifn is odd, 
(24) fin) = li if n is even, 
and let 


e2nibin2*-? if n is odd, 
g(n) = lo if n is even, 


Where b(n) is the integer given by Theorem 10.11. Then it is easy to verify 
that each of f and g is a character mod 2*. So is each product 


Q5) x, dn) = f ntn 


where a = 1,2 and c = 1,2,..., 9(2%)/2. Moreover these ¢(2*) characters are 
distinct so they represent all the characters mod 2*. 

Now if m = 2*Q where Q is odd, we form the products x = y,y; where 
X1 runs through the g(2*) characters mod 2° and y; runs through the «(Q) 
characters mod Q to obtain all the characters mod m. 
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10.12 Real-valued Dirichlet characters 
mod p* 


If x is a real-valued Dirichlet character mod m and (n, m) = 1, the number 

(n) is both a root of unity and real, so y(n) = +1. From the construction 
n the foregoing section we can determine all real Dirichlet characters 
mod p*. 


Theorem 10.12 For an odd prime p and « > 1, consider the q(p*) Dirichlet 
characters y, mod p* given by (21). Then x, is real if, and only if, h = 0 
or h = q(p^)/2. Hence there are exactly two real characters mod p. 


Proor. We have e"* = +1 if, and only if, z is an integer. If p ¥ n we have 


xn) = e rihb(n) op?) 


so x,(n) = +1 if, and only if, o(p?)|2hb(n). This condition is satisfied for all 
nif h = 0 or if h = q(p*)/2. Conversely, if 9(p*)|2hb(n) for all n then when 
b(n) = 1 we have q(p?|2h or q(p?)/2|h. Hence h = 0 or h = (p*)/2 since 
these are the only multiples of ¢(p°)/2 less than (p^). O 


Note. The character corresponding to h = 0 is the principal character. 
When a = 1 the quadratic character x(n) = (n|p) is the only other real 
character mod p. 


For the moduli m = 1, 2 and 4, all the Dirichlet characters are real. 
The next theorem describes the real characters mod 2* when « > 3. 


Theorem 10.13 If « > 3, consider the q(2") Dirichlet characters y, , mod 2? 
given by (25). Then y. is real if, and only if, c = 9(2")/2 orc = 9(2*)/4. 
Hence there are exactly four real characters mod 2* ifa 23. 


Proor. If a > 3 and n is odd we have, by (25), 


Xs, dn) = f (ngin) 
where f(n) = +1 and 


ü a-2 
g(ny E e2nicb(n)/2 4 


with 1 < c < 2-2, This is +1 if, and only if, 2*-?|2cb(n), or 2*7? |cb(n). 
Since o(2*) = 27-1 this condition is satisfied if c — (25/2 = 2*7? or if 


e Toe = 2*~3. Conversely, if 27-3 |cb(n) for all n then b(n) = 1 requires 
2 ?lesoc = 2-* or2-? since 1 < c «oS. 


o 
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10.13 Primitive Dirichlet characters mod p* 


In Theorem 8.14 we proved that every nonprincipal character y mod p 
is primitive if p is prime. Now we determine all the primitive Dirichlet 
characters mod p*. 

We recall (Section 8.7) that y is primitive mod k if, and only if, y has no 
induced modulus d < k. An induced modulus is a divisor d of k such that 


x(n) 21 whenever (n, k) = 1 and n = 1 (mod d). 


If k = p* and y is imprimitive mod p* then one of the divisors 1, p,..., p?~! 
is an induced modulus, and hence p*^! is an induced modulus. Therefore, 
X is primitive mod p^ if, and only if, p*^! is not an induced modulus for y. 


Theorem 10.14 For an odd prime p and & > 2, consider the q(p*) Dirichlet 
characters Xn mod p* given by (21). Then x, is primitive mod p* if, and 
only if, p y h. 

Proor. We will show that p^ ! is an induced modulus if, and only if, p|h. 


If p ¥ n we have, by (21), 


a 
x(n) = grihbin)otp » 


Where n = g*™ (mod p*) and g is à primitive root mod p^ for all f > 1. 
Therefore 
g"? =n (mod p*^!). 


Nowifn = 1 (mod p*-!) then g^? = 1 (mod p*^ !) and, since g is a primitive 
Toot of p*~1, we have o(p" !)|b(n), or 


b(n) = to(p^- !) = to(p)/p 


for some integer t. Therefore 
tan) = =, 
If p|h this equals 1 and hence y, is imprimitive mod p*. If p ¥ h take n = 
1 + p*^!, Then n = 1 (mod p*~!) but n # 1 (mod p°) so 0 < b(n) < o(p?). 
Therefore DA t, p ¥ ht and y,(n) # 1. This shows that y, is primitive if p ¥ h. 
O 


When m = 1 or 2, there is only one character y mod m, the principal 
character, If m = 4 there are two characters mod 4, the principal character 
and the primitive character f given by (24). The next theorem describes all 
the primitive characters mod 2* for æ > 3. The proof is similar to that of 

heorem 10.14 and is left to the reader. 


Theorem 10.15 If æ > 3, consider the q(2*) Dirichlet characters Xa, c Mod 2% 
given by (25). Then y,, is primitive mod 2° if, and only if, c is odd. 
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The foregoing results describe all primitive characters mod P^ for all 
prime powers. To determine the primitive characters for a composite modulus 
k we write 


ar 


kept Be 
Then every character x mod k can be factored in the form 


x—33 Xr 


where each y; is a character mod p;^. Moreover, by Exercise 8.12, y is primi- 
tive mod k if, and only if, each y; is primitive mod p;“. Therefore we have a 
complete description of all primitive characters mod k. 


Exercises for Chapter 10 


1. Prove that m is prime if and only if exp,(a) = m — 1 for some a. 
2. If (a, m) = (b, m) = 1 and if (exp, (a), exp,(b)) = 1, prove that 
exp,,(ab) = exp,(a)e"»,.(b). 


3. Let g be a primitive root of an odd prime p. Prove that —g is -lso a primitive root 
of p if p = 1 (mod 4), but that exp,(—g) = (p — 1)/2 if n = 3 (mod 4). 


4. (a) Prove that 3 is a primitive root mod p if p is a prime of the form 2^ + 1, n > 1. 


(b) Prove that 2 is a primitive root mcd p if p is a prime of the form 4q + 1, where q 
is an odd prime. 


. Let m > 2 bean integer having a primitive root, and let (a, m) = 1. We wri'e aRm if 
there exists an x such that a = x? (mod m). Prove that: 


(a) aRm if, and only if, a? = 1 (mod m). 
(b) If aRm the congruence x? = 


= a (mod m) has exactly two solutions. 


(c) There are exactly ¢(m)/2 integers a, incongruent mod m, such that (a, m) = 1 
and aRm. 


Assume m > 2, (a, m) = 1, aRm. Prove that the congruence x? = a (mod m) has 
exactly two solutions if, and only if, m has a primitive root. 


Let Sp) = Y 21 k", where p is an odd prime and n > 1. Prove that 


So) = Í 0 (og p) ifn # 0 (mod p — 1), 
—1 {mod p) ifn=0 (mod p — 1). 
Prove that the sum of the primitive roots mod p is congruent to j(p — 1) mod p. 
. If p is an odd prime > 3 prove that the product of the primitive roots mod p is 
congruent to ] mod p. 


10. Let p be an odd prime of the form 2”* + 1. Prove that imiti 
i $ the set 
mod p is equal to the set of quadratic n set of primitive roots 


i to onresidues mod p. Use thi 
that 7 is a primitive root of every such prime. i pi 
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11. Assume d|¢(m). If d = exp,,(a) we say that a is a primitive root of the congruence 
x! = 1 (mod m). 
Prove that if the congruence 
x?" = | (mod m) 
has a primitive root then it has y(g(m)) primitive roots, incongruent mod m. 
12. Prove the properties of indices described in Theorem 10.10. 
13. Let p be an odd prime. If (h, p) = 1 let 
S(h) = {h": 1 < n < gp - D, (mp — 1) = 1}. 


If h is a primitive root of p the numbers in the set S(h) are distinct mod p (they are, 
in fact, the primitive roots of p). Prove that there is an integer h, not a primitive root 
of p, such that the numbers in S(h) are distinct mod p if, and only if, p = 3 (mod 4). 

14. If m > 1 let p,, ..., p, be the distinct prime divisors of gm). If (g, m) = 1 prove 
that g isa primitive root of m if, and only if, g does not satisfy any of the congruences 
g*^ = | (mod m) for i = 1,2,...,k. 

15. The prime p — 71 has7asa primitive root. Find all primitive roots of 71 and also 
find a primitive root for p? and for 2p. 


16. Solve each of the following congruences: 
(a) 8x = 7 (mod 43). 


(b) xë = 17 (mod 43). 
(c) 8* = 3 (mod 43). 


17. Let q be an odd prime and suppose that p = 4q + 1 is also prime. 


(a) Prove that the congruence x? = —1 (mod p) has exactly two solutions, each 


of which is quadratic nonresidue of p. t Ek . 
(b) Prove that every quadratic nonresidue of p is a primitive root of p, with the 


exception of the two nonresidues in (a). 
(c) Find all the primitive roots of 29. 
18. (Extension of Exercise 17.) Let q be an odd prime and suppose that p = 2"q + 1 is 
Prime. Prove that every quadratic nonresidue a of p is a primitive root of p if 
a^" £1 (mod p). 
19. Prove that there are only two real primitive characters mod 8 and make a table 
Showing their values. 


20. Let x be a real primitive character mod m. If m is not a power of 2 prove that m has 
the form 


til 


m= 2°), tP, 
where the p, are distinct odd primes and a = 0, 2, or 3. Ifa = 0 show that 


4-9 [oe 


pim 


and find a corresponding formula for x(— 1) when a = 2. 
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Dirichlet Series and 
Euler Products 


11.1 Introduction 


In 1737 Euler proved Euclid’s theorem on the existence of infinitely many 
primes by showing that the series > p^, extended over all primes, diverges. 
He deduced this from the fact that the zeta function t(s), given by 


aa) w= Y 


n=1 


A 
T. 


for real s > 1, tends to oo as s — 1. In 1837 Dirichlet proved his celebrated 
theorem on primes in arithmetical progressions by studying the series 


@) Us, = Y 


where y is a Dirichlet character and s > 1. 
The series in (1) and (2) are examples of series of the form 


O 26 


a=1 " 


where f (n) is an arithmetical function. Th 
coefficients f(n). They constitute one of 
number theory. 

This chapter studies general properties of Dirichlet series, The next 


chapter makes a more detailed study of the Riemann zeta functi d 
the Dirichlet L-functions L(s, y). eds ep 


ese are called Dirichlet series with 
the most useful tools in analytic 


224 


11.2: The half-plane of absolute convergence of a Dirichlet series 


Notation Following Riemann, we let s be a complex variable and write 
s=0 +it, 


where c and t are real Then nē = e7 "8" = et logn — pei losn, This 
shows that |n*| = n° since |e| = 1 for real 0. 


The set of points s = ø + it such that o > a is called a half-plane. We will 
show that for each Dirichlet series there is a half-plane c > c, in which the 
series converges, and another half-plane c > c, in which it converges 
absolutely. We will also show that in the half-plane of convergence the series 
represents an analytic function of the complex variable s. 


11.2 The half-plane of absolute convergence 
of a Dirichlet series 


First we note that if c > a we have [| = n? > n? hence 


ft)| | feni. 


nm n 


Therefore, if a Dirichlet series x f (n)n^? converges absolutely for s = a + ib, 
then by the comparison test it also converges absolutely for all s with a > a. 
This observation implies the following theorem. 


Theorem 11.1 Suppose the series Y, | f (n)n ^| does not converge for all s or 
diverge for all s. Then there exists a real number c,, called the abscissa of 
absolute convergence, such that the series ); f (n)n ? converges absolutely 
if a > c, but does not converge absolutely if à < 64. 


PROOF. Let D be the set of all real ø such that » | f (n)n *| diverges. D is not 
empty because the series does not converge for all s, and D is bounded above 

cause the series does not diverge for all s. Therefore D has a least upper 
bound which we call c. Ifo < c, then « € D, otherwise ø would be an upper 
bound for D smaller than the least upper bound. If o > c, then c ¢ D since 
9, is an upper bound for D. This proves the theorem. BH 


Note. If ' | f(n)n^*| converges everywhere we define c, = — oo. If the 
Series 5" | f(n)n^*| converges nowhere we define c, = 4- oo. 


EXAMPLE 1 Riemann zeta function. The Dirichlet series ) 7. , n^* converges 
absolutely for c > 1. When s = 1 the series diverges, so c, = 1. The sum 
Of this series is denoted by /(s) and is called the Riemann zeta function. 


EXAMPLE 2 If f is bounded, say | f(n)| < M for all n > 1, then Y f(n)jn-* 


Converges absolutely for ø > 1, so c, < 1. In particular if y is a Dirichlet 
character the L-series L(s, x) = >. x(n)n * converges absolutely for o > 1. 
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EXAMPLE 3 The series }, n"n^ ? diverges for every s so 9, = +0. 
EXAMPLE 4 The series ) n^ "n^? converges absolutely for every sso c, = — co. 
11.3 The function defined by a Dirichlet 

series 


Assume that Y" f(n)n^* converges absolutely for o > c, and let F (s) denot 
the sum function 


e 
(4) F(s) = z £0) fora > o,. 


This section derives some properties of F(s). First we prove the following 
lemma. 


Lemma 1 If N > 1 and o > c > a, we have 


Y ft 
n-N $ 


ao 
€ N^679 Y | f(n)In-*. 
nzN 
Proor. We have 


È flan] < Y Ifon = Y foinne- 
n-N n-N n=N 


< N76-9 Y | f(n)In-*. o 
n=N 
The next theorem describes the behavior of F(s) as e + +00. 
Theorem 11.2 If F(s) is given by (4), then 
lim F(c + it) = f(1) 
0o 


uniformly for — oo < t < +o. 


Proor. Since F(s) = f+ Y? finn 
second term tends to Oas c > Is Na S fist te 


implies - Choosec > o,.Then for > cthelemma 
- v fin) eee 4 
Ju) Sero "i 
2 n* <2 Lleyn = s 


where A is independent of c and 


t. Since 4/27 
the theorem. 


> 0as c +00 this proves 


o 
EXAMPLES C(o + it) > 1 and L(c + it, y) 
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11.3: The function defined by a Dirichlet series 


We prove next that all the coefficients are uniquely determined by the 
sum function. 


Theorem 11.3 Uniqueness theorem. Given two Dirichlet series 


F(s) = b Im and G(s) = 5 NU 


n 


both absolutely convergent for € > 6a. If F(s) = G(s) for each s in an infinite 
sequence {s,} such that a, > +% as k — œ, then f(n) = g(n) for every n. 


Pnoor. Let h(n) = f(n) — g(n) and let H(s) = F(s) — G(s). Then H(s,) = 0 
for each k. To prove that h(n) = 0 for all n we assume that h(n) # 0 for some n 
and obtain a contradiction. 

Let N be the smallest integer for which h(n) 4 0. Then 


zo) MN), £ MD 


HOS an m NS EX ED 
Hence 
as MER 
h(N)- NH) -N Y —— 
n=N+1 7 


Putting s =: s, we have H(s,) = 0 hence 
o 
h(N) = —N* Y h(nn^* 
n=N+1 


Choose k so that ø, > c where c > o,. Then Lemma 1 implies 


E: / N WV 
ke -(ex- c) Do | 
|A(N)| € NN + 1) MALOL (x x i) A 


where A is independent of k. Letting k + oo we find (N/N + 1))** — 0 so 
h(N) = 0, a contradiction. o 


The uniqueness theorem implies the existence of a half-plane in which 
a Dirichlet series does not vanish (unless, of course, the series vanishes 
identically). 


Theorem 11.4 Let Fi (s) = 2: f(n)n~* and assume that F(s) # 0 for some s with 
o> a,. Then there is a half-plane € > c = 6, in which F(s) is never zero. 


PROOF. Assume no such half-plane exists. Then for every k = 1, 2,... there 
1S a point s, with c, > k such that F(s,) = 0. Since o, > +% as k > co the 
uniqueness theorem shows that (n) = 0 for all n, contradicting the hypothesis 
that F(s) ¥ 0 for some s. a 
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11.4 Multiplication of Dirichlet series 
The next theorem relates products of Dirichlet series with the Dirichlet 
convolution of their coefficients. 


Theorem 11.5 Given two functions F(s) and G(s) represented by Dirichlet series, 


F(s) = y fe) fore a, 
n=1 M 


and 


G(s) = » gin) for o >b. 


m 
Then in the half-plane where both series converge absolutely we have 

e h(n 
© Fijo) = y, "o 


n=1 


in 


where h = f * g, the Dirichlet convolution of f and g: 
Mn) = Y ras.) 
d|n d 


Conversely, if F(s)G(s) = Y o(n)n^* for all sina sequence (s,) with a, > -- co 
ask > oo thena = f *g. 


Proor. For any s for which both series converge absolutely we have 


FUG) = Y Son Y amm = reign) 
n- m= n=1m=1 


Because of absolute convergence we can multiply these series together and 
rearrange the terms in any wa 


Way we please without altering the sum. Collect 
together those terms for which mn is constant, say mn = k. The possible values 
of k are 1, 2,..., hence 


F(s)G(s) = Pal RI (eig) e= - Š aiok: 
=1\mn= k=1 


where hlk) = Xmn=x f(n)g(m) = (f * g)(k). This 


roves th i 
and the second follows from the : SESS assertion, 


uniqueness theorem. ü 


EXAMPLE 1 Both series Y n7* and È u(r)n7 conver 
A c ge absolutely fc 1. 
Taking f (n) = 1 and g(n) = p(n) in (5) we find h(n) = [1/n], e" E 


Us) Y ml. ifo 1. 
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In particular, this shows that ¢(s) # 0 for c > 1 and that 


— ifo>1. 
s 


EXAMPLE 2 More generally, assume f (1) # 0 and let g = f ^5, the Dirichlet 
inverse of f. Then in any half-plane where both series F(s) = ); f(n)n ^? and 
G(s) = Y. g(n)n™* converge absolutely we have F(s) # 0 and G(s) = 1/F(s). 


EXAMPLE 3 Assume F(s) = Y, f(n)n ? converges absolutely for o > o,. If 

f is completely multiplicative we have f 7!(n) = p(n) f (n). Since | f~*(n)| < 

" (n)| the series $^ u(n) f (n)n ^ * also converges absolutely for ¢ > c, and we 
ave 


pfe | 1 


p 5: TG ic 0 


In particular for every Dirichlet character y we have 


S pnyxn) arr 

t em ifo>l. 
am "m L{s, x) 
EXAMPLE 4 Take f(n) = 1 and g(n) = 9(n), Euler's totient. Since (n) < n 
the series Y^ (n)n~* converges absolutely for c > 2. Also, h(n) = Fan Eld) 
= n so (5) gives us 
Z=- 1) ifo>2 
n nin 


Therefore 


ExaMPLE 5 Take f(n) = 1 and g(n) = n. Then h(n) = 2, d* = o4(n), and 
(5) gives us 
(sys — a) = y zs ifo > max{1, 1 + Re(a)). 
n=1 


EXAMPLE 6 Take f(n) = 1 and g(n) = A(n), Liouville’s function. Then 


1 ifn =m? for some m, 
hln) = 10 S t otherwise, 


50 (5) gives us 


oo 1 LÀ 1 
woe ee 3 en Rae ™ 
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Hence 


==> doli 


11.5 Euler products 


The next theorem, discovered by Euler in 1737, is sometimes called the 
analytic version of the fundamental theorem of arithmetic. 


Theorem 11.6 Let f be a multiplicative arithmetical fu 
È f(n) is absolutely convergent. Then the sum oft 
as an absolutely convergent infinite product, 


nction such that the series 
he series can be expressed 


(6) È so -[Iü + s@ + fo) +--3 


extended over all primes. If 
simplifies and we have 


= 1 
(7) PEL) = Nip: 


Note. In each case the product is called the Euler product of the series. 


Proor. Consider the finite product 


Pix) = IIO + f0) + ft?) +--4 


f is completely multiplicative, the product 


extended over all 
of absolutel 
the terms i 
form 


primes p « x. Since this is the product of a finite number 
y convergent Series we can multiply the series and rearrange 
n any fashion without altering the sum. A typical term is of the 


SOSO) ++ fpr) = f(p,^p,* 
since f is multiplicative, B 
write 


as) 


y the fundamental theorem of arithmetic we can 


P(x) 2 Y f(n) 
ned 

where A consists of those n having all their prime factors « x. Therefore 

@ 

Y So) - P) = y fq), 

La neB 
where B is the set of n having at least one 
X fin) — Pee) 


As x — co the last sum on the right — 0 since i ce 
Poe) + Y. f(n) as x -» co. È 1f ()] is convergent. Hen 


prime factor > x. Therefore 


s Y Ift < Lis). 
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Now an infinite product of the form [[(1 + a,) converges absolutely 
whenever the corresponding series ) a, converges absolutely. In this case 
we have 


Tso) + S+ l EUSI +-- s 2, 1f. 
psx psx a= 
Since all the partial sums are bounded, the series of positive terms 
Yo) fo) 
P 
converges, and this implies absolute convergence of the product in (6). 
Finally, when f is completely multiplicative we have f(p") = f(p)" and 
each series on the right of (6) is a convergent geometric series with sum 
a — foy. o 


Applying Theorem 11.6 to absolutely convergent Dirichlet series we 
immediately obtain: 


Theorem 11.7 Assume Y, f(n)n * converges absolutely for o > oa. If f is 
multiplicative we have 
oo 2 
6) $f AA) usns 
n=1 " P p p 


and if f is completely multiplicative we have 


y I. 


1 
n H 1— f(pp* 
It should be noted that the general term of the product in (8) is the Bell 
Series f,(x) of the function f with x = p~*. (See Section 2.16.) 


ifo > Oa. 


n=1 


ExaMPLES Taking f(n) = 1, p(n), (n), oa(n), A(n) and x(n), respectively, we 
obtain the following Euler products: 


- 1 1 am 
t= YI ifo > 1. 
1 şe qp ifo>t. 
fe oe c RACER 
(s-) a eA mE tes 2, 
Te 7 gi aaa 
ü TERN v on) 1 if 1,1+R 
(sX(s — a) bs = I a—5758 775 ifo > max(1, 1 + Re(a)}, 


tQ)  & An) _ it aaa 
dip peer” 


e x(n) 1 : 
Lis.) = Le lie ifo>1. 


n=1 
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Note. If x = xı, the principal character mod k, then xi(p) = 0 if pik 
and y,(p) = 1 if p Y k, so the Euler product for L(s, x,) becomes 


1 1 =s -s 
Us x) 7 [zs TT - 9) = oa - 2. 
pik — p pl—p* op pik 
Thus the L-function L(s, 
finite number of factors. 


X1) is equal to the zeta function (s) multiplied by a 


11.6 The half-plane of convergence of a 
Dirichlet series 


To prove the existence of a half-plane of convergence we use the following 
lemma. 


Lemma 2 Let s, = Go + itg and assu 


me that the Dirichlet series Y. f(n) 
has bounded partial sums, say 


È fini» 


nsx 


Jor all x > 1. Then for each s with o > 09 we have 


(9) < xd iter oa. 
0 — 0, 


<M 


2. An 


a<ns 


PROOF. Let a(n) = 


f(n)n^** and let A(x) = 
a(n)n°~5 


sxG(n) Then f(n)n75 = 
So we can apply Theorem 4,2 (with f(x) 


= x5) to obtain 
b 
2; SOn = Abb — Alaja +. (5 — 54) [40077 a. 
a<ns a 
Since | 4(x)| < M this gives us 


È f(nn- 


a<nsb 


S Mb^^7* + Mavo-2 


b 
+|s— sIm [ BORSA dp 
a 


b%-¢ — q9o-9 
—— s 


09 — 0 


< 2Ma~? 4 Is — So|M 


" Me QI sl) 7 


9 — do 
EXAMPLES If the partial sums 


X. f(n)n^* converges for c 
obtain, for o > 0, 


Moss f(n) are bounded, 


Lemma 2 implies that 
> 0. In fact, if we take 


So = Go = 0 in (9) we 


X finn 


< Kar" 
a<nsb 
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hs where K is independent of a. Letting a— +o we find that Y f(n)n^* 
converges if ¢ > 0. In particular, this shows that the Dirichlet series 


e (-nD 
p n 

converges for e > 0 since |$,., (—1)| < 1. Similarly, if y is any non- 
principal Dirichlet character mod k we have |Ð n <x x(n)| < e(k) so 


Y x(n) 


" 


n-1 


converges for o > 0. The same type of reasoning gives the following theorem. 


1 Theorem 11.8 If the series Y, f(n)n ? converges for s = Go + ito then it also 
converges for all s with o > ag. If it diverges for s — Gg + itg then it 

| diverges for all s with € < Go. 

| Pnoor. The second statement follows from the first. To prove the first 
statement, choose any s with o > co. Lemma 2 shows that 


Y SOn Eek 
a<nsb 
where K is independent of a. Since a^^ > 0 as a > +0, the Cauchy con- 
dition shows that È, f(n)n ^ converges. [mi 


Theorem 11.9 If the series $, f(n)n ^? does not converge everywhere or diverge 
everywhere, then-there exists a real number c., called the abscissa of 
convergence, such that the series converges for all s in the half-plane o > c, 
and diverges for all s in the half-plane o < 64- 


Pnoor. We argue as in the proof of Theorem 11.1, taking c, to be the least 
upper bound of all ø for which Y, f (nmn? diverges. oO 


Note. If the series converges everywhere we define c, = — oo, and if it 
converges nowhere we define c, = +0. 


Since absolute converge implies convergence, we always have Ga 2 Os 
If c, > c, there is an infinite strip c, < & < c, in which the series converges 
Conditionally (see Figure 11.1.) The next theorem shows that the width of 


| this strip does not exceed 1. 


Divergence Convergence 
Conditional Absolute convergence 
convergence 


Figure 11.1 
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Theorem 11.10 For any Dirichlet series with c, finite we have 
0<o,-—0,<1. 


Proor. It suffices to show that if y» f (n)n^?? converges for some s, then it 
converges absolutely for all s with & > o + 1. Let A be an upper bound for 
the numbers | f (n)n??|. Then 


f| |fe|| 1 4 
m m? m^3|^7 p~eo 
so Y |-f(n)n^?| converges by comparison with ); n7°~°, oO 
EXAMPLE The series 
2 (—1y 
p n 


converges if > 0, but the convergence is absolute only if ¢ > 1. Therefore 
in this example c, — 0 and c, — 1. 


Convergence properties of Dirichlet series can be compared with those 
of power series. Every power series has a disk of convergence, whereas 
every Dirichlet series has a half-plane of convergence. For power series 
the interior of the disk of convergence is also the domain of absolute con- 
vergence. For Dirichlet series the domain of absolute convergence may 
be a proper subset of the domain of convergence. A power series represents 
an analytic function inside its disk of convergence. We show next that a 


Dirichlet series represents an analytic function inside its half-plane of 
convergence, 


11.7 Analytic properties of Dirichlet series 


Analytic properties of Dirichlet series will be deduced from the following 
general theorem of complex function theory which we state as a lemma. 


Lemma 3 Let {f,} be a sequence of functions analytic on an open subset S 
of the complex plane, and assume that { f, 


ass } converges uniformly on every 
compact subset of S to a limit function f. Then f is analytic on S and the 
sequence of derivatives { f7} converges uniformly on every compact subset of 
S to the derivative f". 


Pnoor. Since f, is analytic on S we have Cauchy's integral formula 


fla) = f 4 


"i Japz — a 


dz 
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where D is any compact disk in S, @D is its positively oriented boundary, 
and a is any interior point of D. Because of uniform convergence we can pass 
to the limit under the integral sign and obtain 


2ni Jap Z — a 


which implies that fis analytic inside D. For the derivatives we have 


sta) = 5 [ Md ars manae a mE ar. 


2xi Jap (z — a)? 22i Jap (z — a)? 


from which it follows easily that f(a) > f'(a) uniformly on every compact 
subset of S as n — oo. 


To apply the lemma to Dirichlet series we show first that we have uniform 
convergence on compact subsets of the half-plane of convergence. 


Theorem 11.11 A Dirichlet series ), f(n)n ^ converges uniformly on every 
compact subset lying interior to the half-plane of convergence c > ce. 


Pnoor. It suffices to show that Y; f(n)n^? converges uniformly on every 
compact rectangle R = [«, f] x [c,d] with « > o.. To do this we use the 
estimate obtained in Lemma 2, 


/ 


* “ane Is — sol 


where so = Go + itọ is any point in the half-plane & > c. and s is any point 
With c > a9. We choose so = co Where g, < eo < &. (See Figure 11.2.) 


ee iod 


4 h 
a 


So 7 90 


dii-- 


+ 
[ 
Figure 11.2 


Then if se R we have c — Oo 20 — 0o and |so — s| < C, where C is a 
Constant depending on s, and R but not on s. Then (10) implies 


Y fens 


a<nsb 


C 
o | = B jg9o-24 
< 2Ma^* ( zr a =) = ba 


Where B is independent of s. Since a^? * — 0 as a > +00 the Cauchy condi- 
tion for uniform convergence is satisfied. [m] 
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Theorem 11.12 The sum function F(s) = 5 fann ofa Dirichlet series is 
analytic in its half-plane of convergence c > c., and its derivative F'(s) 
is represented in this half-plane by the Dirichlet series 


2 f(n) 
(11) F(s)= — 2 emi 


obtained by differentiating term by term. 


Proor. We apply Theorem 11.11 and Lemma 3 to the sequence of partial 
sums. 0 


Notes. The derived series in (11) has the same abscissa of convergence and 
the same abscissa of absolute convergence as the series for F (s). 


Applying Theorem 11.12 repeatedly we find that the kth derivative is 
given by 


c. 


» k 
Fs) = (—1) Dy Poen for o > c, 
n=1 


EXAMPLES For c > 1 we have 


(12) (e - y Bi" 
n=1 Ħ 
and 
US) AG) 
(13) sO 
Us) », n 


Equation (12) follows by differentiating the series for the zeta function term 
by term, and (13) is obtained by multiplying the two Dirichlet series X A(n)n ^? 
and n^* and using the identity 54, A(d) = log n. 


11.8 Dirichlet series with nonnegative 
coefficients 


Some functions which are defined b 
convergence c > c, can be continu 
For example, in the next chapter we 
t(s) can be continued analytically b 
is analytic for all s except for a sim 
principal Dirichlet character, the 

alytically beyond the line ¢ = Lit 
The singularity for the zeta functio 
Landau which deals with Dirichle 
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y Dirichlet series in their half-plane of 
ed analytically beyond the line o = o,. 
will show that the Riemann zeta function 
eyond the line ¢ = 1 to a function which 
ple pole at s = 1. Similarly, if y is a non- 
L-function L(s, y) can be continued an- 
o an entire function (analytic for all s). 
n Is explained by the following theorem of 
t series having nonnegative coefficients. 


e — 


11.8: Dirichlet series with nonnegative coefficients 


Theorem 11.13 Let F(s) be represented in the half-plane o > c by the Dirichlet 
Series 


(14) wns 
n=1 


where c is finite, and assume that f (n) > 0 for all n > no. If F(s) is analytic 
in some disk about the point s — c, then the Dirichlet series converges in the 
half-plane o > c — e for some £ > 0. Consequently, if the Dirichlet series 
has a finite abscissa of convergence o,, then F(s) has a singularity on the real 
axis at the point s = o,. 
PROOF. Let a = 1 + c. Since F is analytic at a it can be represented by an 
absolutely convergent power series expansion about a, 
z F*Xa 
(15) r= 5 o 
and the radius of convergence of this power series exceeds 1 since F is 
analytic at c. (See Figure 11.3.) By Theorem 11.12 the derivatives F(a) can 
be determined by repeated differentiation of (14). This gives us 


Fa) = (1) Y. fln)(log n'n7*, 
n=1 


f (n) 


n 


(s — a), 


So (15) can be rewritten as 


c œ 


(16) F=} > 


k=0 n=1 
Since the radius of convergence exceeds 1, this formula is valid for some real 
$— c — & where ¢ > 0 (see Figure 11.3.) Then a — s = 1 +e for this s 
and the double series in (16) has nonnegative terms for n > no. Therefore 
We can interchange the order of summation to obtain 


CZE finog nj". 


zi e ER VAL S f(n) 
Fe— ja 5 LM F (A+ Plog nt S LO) rte ign m 
d= m ke. k! er Lor 
In other words, the Dirichlet series Y. f(n)n^* converges for s = c — c, hence 
It also converges in the half-plane & > c — £. Oo 
Figure 11.3 
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11.9 Dirichlet series expressed as exponentials 
of Dirichlet series 
A Dirichlet series F(s) = $^ f(n)n^? which does not vanish identically has a 


half-plane in which it never vanishes. The next theorem shows that in this 
half-plane F(s) is the exponential of another Dirichlet series if f(1) 4 0. 


Theorem 11.14 Let F(s) = y f(n)n^? be absolutely convergent for o > c, 
and assume that f(1) # 0. If F(s) # 0 for o > oo > o,, then for o > cg 
we have 


F(s) = e& 
with 
SES ef y)m., 
G(s) = log f(1) + x Pau hails 
where f ^! is the Dirichlet inverse of f and f'(n) = f(n)log n. 


Note. For complex z # 0, log z denotes that branch of the logarithm which 
is real when z > 0. 


PROOF. Since F(s) # 0 we can write F(s) = e% for some function G(s) which 
is analytic for ¢ > oy. Differentiation gives us 

F'(s) = e99G'(s) = F(s)G'(s), 
So G'(s) = F'(s/F(s). But 


Ss 5 


‘he v f(nlogn w f'n) 1 w fin) 
F(s) = — T 2 
t) 2 n li n and Bis qm 
hence 


j= F.L- Sn 
G'(s) = F'(s) O li T , 
Integration gives 


S fn) 
G(s) =C ——M———n^ 
Dod iue T 

Where C is a constant. Lettin 


B 0 +00 we find lim, ,, G(o + it) = C, 
hence 


f(1) = lim F(s + it) = e€ 


g-o 


so [^ = log f(1). This completes the proof. The proof also shows that the 
series for G(s) converges absolutely if a > Go. g 
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EXAMPLE 1 When f(x) = 1 we have f'(n) = log n and f ^ !(n) = p(n) so 


n 


(f' * f!) = log a(z) = A(n). 
din 


Therefore if c > 1 we have 


(17) Us) = e& 
where 
a S AQ. 
G(s) = LE [m n 5 


EXAMPLE 2 A similar argument shows that if f is completely multiplicative 
and F(s) = Y, f(n)n^ then in the half-plane of absolute convergence o > o, 
we have 


F(s) = e& 
where 
— & SMA) -. 
Gs) = 2 logn n 


since (f’ « f -!)n) = Yay f(d)log dy(n/d) f (n/d) = f (AQ). 


The formulas in the foregoing examples can also be deduced with the 
help of Euler products. For example, for the Riemann zeta function we have 


p 


Keep s real, s > 1, so that ((s) is positive. Taking logarithms and using the 
Power series —log(1 — x) = }, x"/m we find 


1 
t(s) = li 
P 


© -ms o 
where 
A(n) = > if n = p" for some prime p, 
0 otherwise. 
But if n — p" then logn = mlog p = mA(n) so 1/m = A(n)/log n. Therefore 
MEER 
og C(s) = 2. logn n 


which implies (17) for real s > 1. But each member of (17) is analytic in the 
half-plane o > 1 so, by analytic continuation, (17) also holds for o > 1. 
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11.10 Mean value formulas for Dirichlet 
series 


Theorem 11.15 Given two Dirichlet series F(s) = >D f(n)n^5 and Gs) = 
Y. g(n)n^? with abscissae of absolute convergence c, and 05, respectively. 
Then for a > c, and b > 6; we have 


lim > jn Fla + ite — it) dt = y LO). 
ST n=1 


EET; "É 
PRoor. We have 

mën 
Now 


so the series is absolutely convergent, and this convergence is also uniform 
for all t. Hence we can integrate term by term and divide by 2T to obtain 


pur? y h 
2T f mu + it)G(b — it) dt 


-EL p Loo 1 


nos maci mn 2T Jj 
mtn 


a log 7 
Bn 


e" log(n/m) dt. 
But for m # n we have 


eit log(n/m) 


{i et log(n/m) dt = 
-T i log(n/m) 


so we obtain 


1 T 
2T f MRa + it)G(b — it) dt 


" n 
EXC NEEDLE (| 
TN oe, mn Y 


m*n T tes) 
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11.10: Mean value formulas for Dirichlet series 


Again, the double series converges uniformly with respect to T since (sin x)/x 
is bounded for every x. Hence, we can pass to the limit term by term to 
obtain the statement of the theorem. oO 


Theorem 11.16 If F(s) =), f(n)n^? converges absolutely for c >o, 
then for o > o, we have 


= ke fe oerte 
(18) lim zz] Fo + ih dt = ym 
In particular, if o > 1 we have 
(a) tim 4L. [Ite + iP de = Y A, = C20) 
rx Hr] IU OE e 
(b) tim JL. J Ne + in? dt = y Peg". tono, 
T+0 2T J-r nat n 
METTUS 
(© lim 5 fete + i die ye aa 


200p (T — x eo'(n  L*Qo) 
O tim zp eripias È E = ao 


PRoor. Formula (18) follows by taking g(n) = J(n) in Theorem 11.15. To 
deduce the special formulas (a) through (d) we need only evaluate the Dirichlet 
Series )' | f(n)|?n^ ?* for the following choices of f (n): (a) f(n) = 1;(b) f(n) = 
(—1)* log* n; (c) f(n) = u(n); (d) f(n) = co(n). The formula (a) is clear, and 
formula (b) follows from the relation 

2 log*n 


(9 = (7D Y 75 


To prove (c) and (d) we use Euler products. For (c) we have 


e y(n) ~The Me) 
2 " aiiud Te lx ym (Qs) 
Replacing s by 2a we get (c). For (d) we write 
œ 2 
oo = [I + esp? + cop?) +-+} 
n=1 P = 


=[] a 423^ + 3p +--+} 

o Lal] maera fü) 
- I [Ee eve] = acp 7 o 
xl xs 


— M =- Now replace s by 2c to get (d). 
(x — para —x* 


m] 


SAL 


Since yo (n + 1P = 
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11.11 An integral formula for the coefficients 
of a Dirichlet series 


Theorem 11.17 Assume the series F(s) = Y z-1f(n)n * converges absolutely 
for o > 6a. Then for o > c, and x > 0 we have 


T ; fin) ifx=n, 
S ctit — 
m S x] 2 His pus 4 otherwise. 
Proor. For o > c, we have 


(19) xl. F(o + it)x?** dt = boi =f we f(r) (=) at 


C 
Ta=1 " 


xt LJ fin) T 
m 2 n^ 


elt login) dt, 
-T 


since the series is uniformly convergent for all t in any interval [— T; T]. If x 
is not an integer then x/n + 1 for all n and we have 


- 2 saf 7 ESI 
f eit login) dt — n 
ar | ) 


and the series becomes 


which tends to 0 as T > co. However, if x is an integer, say x = k, then the 
term in (19) with n = k contributes 


T (xt T /,\it T 
=i) at = = = Er 
LG) Le M [2 dins 
and hence 


Ei pte AG = ya KL fe» (7 (Ey 
ml at = fe Y fo 6 dt 
n*k 


The second term tends to 0 as T > 
00 as w: ; 
argument. as shown in first part of the 
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11.12: An integral formula for the partial sums of a Dirichlet series 


11.12 An integral formula for the partial 
sums of a Dirichlet series 


In this section we derive a formula of Perron for expressing the partial sums 

of a Dirichlet series as an integral of the sum function. We shall require a 

lemma on contour integrals. 

Lemma 4 Ifc > 0, define (:* 2! to mean limp... [67 1T. Then if a is any positive 
real number, we have 


l ifaz1, 
1 c+oi dz D 
xi] iem 2 dpi 
0 ifücac«l 


Moreover, we have 


ctiT c 
(20) xil pe. Ape 
Aem E zT lo h 
8a 
L quouis gm at x 
21 = -1, 
en 2ni La 2 IE m fa 
and 
p '(erisaz *] ie 
22 = —=—=|<— EL 
dt xl. z is ifa 


PRoor, Suppose first that 0 < a < 1 and consider the rectangular contour _ 
R shown in Figure 114. Since a*/z is analytic inside R we have |p a*/z dz = 0. 


Hence 
c+ iT c+ iT b+ iT b-iT 
f = Í + + f E 
md ^ a b+iT b-iT ei 


c*iT b*iT 


i 


c—iT b—iT 
Figure 11.4 
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so 


(ah b b ax 
+iT € xe [mas 
Şa b ZI 


[^ 
2 E 2T 2(-4Y, 2Ta'. 
ione pr TN b 


Let b > oo. Then a’ — 0, hence 


+ iT dz 2a* 
| f PAS — Tn 
c-iT Z = 
T tod) 
This proves (20). 
-b+iT c*iT 
-b 7G c 
-b-iT c-iT 


Figure 11.5 


Ifa > 1 weuseinstead the contour R shown in Figure 11.5. Hereb > c > 0 
and T > c. Now a^/z has a first order pole at z = 0 with residue 1 since 


d —e&'*5-—1.-zloga + O(|z|?) asz 0. 
Therefore 


ctiT -b*iT -b-iT 
2ni = (J $ Í +f «pn A —, 
- c-iT ctiT b+iT =if, 


hence 


1 n. ds 1 c+iT -b+iT TR y 
uS @—-1=— I + f + i *—. 
Rri Je-ir z zal ~b+iT -b-iT c-iT oF 


We now estimate the integrals on the right. We have 
ctiT 
d c x c C 
Í "re ras trae 
ir Z zb T Tj. Tloga 
i b+iT dz 
b- e z 
-b-iT c x É 
Í enja f mmo 
c-iT z ep od Tloga 
As b > oo the second integral tends to 0 and we obtain (21). 
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a" 
, 


<2T 
b 


11.12: An integral formula for the partial sums of a Dirichlet series 


When a = 1, we can treat the integral directly. We have 


c+iT T x T T 
dz d 

ji I t » = = zdy + ic [ " I 

c-ir Z -re+ly -rC€ Ry -TC ty 


f 
= 2ic | E. 21 
oC ty 


the other integral vanishing because the integrand is an odd function. Hence 


] pev mam wg 1 T d.d c 
y 

- =- = ; ^ arctan — = = — —arctan —. 

2ni Jer Z mJoc^-cy n e 2 we T 


Sincearctan c/T < c/T this proves (22), and the proof of Lemma 4 is complete. 
oO 


Theorem 11.18 Perron’s formula. Let F(s) = $, f(n)/n* be absolutely 
convergent for ¢ > 6,; let c > 0, x > Obe arbitrary. Then if a > o, — c we 
have: 

1 c oi x fin) 

a ; S odreytiMe 

xj." 4 z) b z P = 

where 3 * means that the last term in the sum must be multiplied by 1/2 when 

x is an integer. 


Proor. In the integral, c is the real part of z, so the series for F(s + z) is 
absolutely and uniformly convergent on compact subsets of the half-plane 
o +c > o,. Therefore 


ctiT z c+iT œ z 
| e+e f X IT a 
T 


c-i c-iT n=1 Z 


e] fin) +iT x [dz 
Ex G) z 


Ein 


D fin) NL Y fi) NOE 


ll 


s 
nex W Je-ir\ n) Z 0 ax A Je-ir\N} Z 
+iT 
f) dz 
rm > 
X Jum 


the symbol +’ indicating that the last term appears only if x is an integer. 
In the finite sum > ,<x we can pass to the limit T > oo term by term, and the 
integral is 2zi by Lemma 4. (Here a = x/n,a > 1.) The last term (if it appears) 
Yields nif (x)x-* and the theorem will be proved if we show that 


Q3) nay fe "s aÔ 


s 
T>on>x " J-irM/ Z 
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We know that quim (x/ny(dz/z) = 0 if n> x but to prove (23) we must 


estimate the rate at which {¢*j7 tends to zero. 
From Lemma 4 we have the estimate 


iT 
ie EP ag if0<a<l. 
eit Z aC Y 
og 


a 


Here a — x/n, with n> x. In fact, nz [x] + 1, so that l/a = n/x 2 
([x] + 1)/x. Hence 


sn (1 (x Y dz 
2 n ES z 


1f( 2 (xY 1 
$3. n° :G) E m 3) 
log| mu 


2 x Ifin) 
T SER 55 ET >0 asT- o. 
PAT ae 


This proves Perron’s formula. [m] 
Note. If c > a, Perron’s formula is valid for s = 0 and we obtain the 
following integral representation for the partial sums of the coefficients: 
1 c coi 
= F 
2ni Je- i 6 


x 


= de = flo 


Exercises for Chapter 11 


1. Derive the following identities, valid for o > 1. 


@ wss f 5 dx. 
(b) Ys =s ? m dx, where the sum is extended over all primes. 
© 5 =s i A dx, where Mo) = Y pln), 
@ - m =s m ud dx, where Vix) = X An). 
(e) Ls, x) = s f n dx, where A(x) — 20 
Show tei E is also valid for « > 0 if y is a nonprincipal character. [Hint: 
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Exercises for Chapter 11 


2. Assume that the series YS f (n) converges with sum A, and let A(x) = Ya<x f(n). 


(a) Prove that the Dirichlet series F(s) = Yei f(n)n™* converges for each s with 
o > Oand that 
® R(x) 


" xe dh 


e f 
Lag c4 


where R(x) = A — A(x). [Hint: Theorem 42.] 
(b) Deduce that F(c) + A as c > 0+. 
' (c) Ifo > Oand N > Lisan integer, prove that 
S fin) AQ) œ A0) ay 


FQ) = aN +s Lyn 


(d) Write s = ø + it, take N = 1 + [1t] in part (c) and show that 
|F(o + it)| =O") if0<0< I. 


3. (a) Prove that the series )^ n^ ' -it has bounded partial sums if t # 0. When t = Othe 


partial sums are unbounded. 
(b) Prove that the series Yu -it diverges for all real t. In other words, the Dirichlet 


series for C(s) diverges everywhere on the line o = 1. 
4. Let F(s) = 21 f(n)n^* where f(n) is completely multiplicative and the series 
converges absolutely for > 9,- Prove that if ¢ > c, we have 
F() — & SAM) 
F) oat 7 — 


In the following exercises, A(n) is Liouville's function, d(n) is the number of 
divisors of n, v(n) and x(n) are defined as follows: v1) = 0, «(1) = 1; if 
n = p,™ --- p,** then v(n) = k and x(n) = a142 *: * ak- 

Prove that the identities in Exercises 5 through 10 are valid for o > 1. 


s y 42. CO ave 2a) OS 
ani W^ oS P DEG) 
aria) m ou 2 x(n) (SXQSXGS 

6. PA I NIS. pocap x 
POTA * Xo. 06 
229 (QU) e xe) C 

"en tO 10. p n Gs) 


11. Exptess the sum of the series Yer 3%c(n)A(n)n~* in terms of the Riemann zeta 


function. 

12. Let fbe a completely multiplicative function such that f (p) = f (p) for each prime p. 
If the series J, f (n)n ? converges absolutely for > c, and has sum F(s), prove that 
F(s) # 0 and that 


y ftm) _ FQ9) ifo > 6,. 
Ld F(s) 
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iplicati i = f(pY for each prime p. If the series 
13. Let f bea multiplicative function such that f (p) = f(p) 
b3 pia f (n)n~* converges absolutely for ¢ > c, and has sum F(s), prove that F(s) + 0 
and that 


Slan) — Fs) ifo>o,. 
; m F(s) 


mz 


14. Let f be a multiplicative function such that SM f(n)n^? converges absolutely for 
c > c,. If pis prime and c c, prove that 


MOOM 


n 


(1 — f(p)p~*) 


z e Sfiunulp, n) 
(+ fip), L^ 


where jp, n) is the Möbius function evaluated at the ged of p and n. 
[Hint: Euler products.] 


15. Prove that 


in terms of the Riemann zeta function. 


16. Integrals of the form 


(24) Jis = ds A dx, 
1 


where A(x) is Riemann-integrable on eve 
properties analogous to those of Dirichlet Series. For example, they possess a 
half-plane of absolute convergence o > [^ 


and a half-plane of convergence c > o, 
in which f(s) is analytic. This exercise describes an analogue of Theorem 11.13 
(Landaw's theorem). 


Let f(s) be represented in the half-plane o > 9, by (24), where c, is finite, and 


assume that A(x) is real-valued and does not change sign for x > Xo. Prove that 
S(s) has a singularity on the real axis at the point s = g,. 


17. Let À(n) = Dan d'Ald) where A(n) is Liouville’s function. Prove that if o> 
max{1, Re(a) + 1}, we have 


Ty compact interval [1, a], have some 


F Adn) _ UKs — 2a) 
aci n Us — a) 


and 


y mum) tst — a) 
zi NH Q) C 
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The Functions 
C(s) and L(s, x) 


12.1 Introduction 
This chapter develops further properties of the Riemann zeta function (s) 
and the Dirichlet L-functions L(s, y) defined for ø > 1 by the series 


S x(n) 
n 


(9- XL adis X 
n=1 


As in the last chapter we write s = c + it. The treatment of both ¢(s) and 
L(s, x) can be unified by introducing the Hurwitz zeta function ((s, a), defined 
for c > 1 by the series 
eo 1 
(sd Ya 

Here a is a fixed real number, 0 < a € 1. When a = 1 this reduces to the 
Riemann zeta function, (s) = ¢(s, 1). We can also express L(s, y) in terms 
of Hurwitz zeta functions. If y is a character mod k we rearrange the terms 
in the series for L(s, y) according to the residue classes mod k. That is, 
We write 


n=qk +r, where 1 < r < k and q = 0, 1,2,..., 


k 


and obtain 
2 k 2 xgk4r) l« e 1 
Lie via x(n) _ xq t= xe) Y : 
(s, x) A P p (qkt+ry k p q-0 (a ES r) 


-k r 
=k"); wfs i} 
r=1 
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12: The functions C(s) and L(s, x) 


This representation of L(s, x) as a linear combination of Hurwitz zeta func- 
tions shows that the properties of L-functions depend ultimately on those of 
t(s, a). 

Our first goal is to obtain the analytic continuation of ¢(s, a) beyond the 
line ø = 1. This is done through an integral representation for ¢(s, a) obtained 
from the integral formula for the gamma function I (s). 


12.2 Properties of the gamma function 


Throughout the chapter we shall require some basic properties of the gamma 
function T(s). They are listed here for easy reference, although not all of them 


will be needed. Proofs can be found in most textbooks on complex function 
theory. 


For c > 0 we have the integral representation 
(1) T(s) = f xir leo židy. 
o 


The function so defined for ø > 0 can be continued beyond the line ø = 0, 
and I'(s) exists as a function which is analytic everywhere in the s-plane 
except for simple poles at the points 


s=0, —1, —2, -3,..., 


with residue (— 1)'/n! at s = —n. We also have the representation 


Sn! 
I()- lim — 27" — Ed 
(s) PAGE) fors #0, —1, —2,..., 
and the product formula 
1 cs T] SA 
im Dr «3y ^^ for alls, 


where C is Euler's constant. Since the product converges for all s, T(s) is 
H H d 
never zero. The gamma function satisfies two functional equations, 
: 


(2) I(s + 1) = sI' (s) 

and 

6) T(r — s =—_, 
Sin 7s 


valid for all s, and a multiplication formula 


(4) ror(: + 1) ee r(s + -— ? = Qn)" 2 mD mpg) 


valid for all s and all integers m > 1. 
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12.3: Integral representation for the Hurwitz zeta function 


We will use the integral representation (1), the functional equations (2) 
and (3), and the fact that Is) exists in the whole plane, with simple poles at 
the integers s = 0, —1, —2,... We also note that T(n + 1) =n! ifn isa 
nonnegative integer. 


12.3 Integral representation for the Hurwitz 
zeta function 


The Hurwitz zeta function ((s, a) is initially defined for ø > 1 by the series 
C 1 
(s, a) = EX (nay 


Theorem 12.1 The series for ((s,a) converges absolutely for o > 1. The 
convergence is uniform in every half-plane ø => 1 + 0, ô > 0, so t(s, a) is 
an analytic function of s in the half-plane o > jl 


Pnoor. All these statements follow from the inequalities 


Y In +a) = Yat+ars Y (n ayer. o 
aci ’ n=1 n=1 
Theorem 12.2 For o > 1 we have the integral representation 
oo x$319- 45 
(5) T(sXs, a) = Í i — dx. 
ELT 


In particular, when a — 1 we have 


c x57! mx 
nae = [Ta 


Proor. First we keep s real, s > 1, and then extend the result to complex s 
by analytic continuation. 


In the integral for T(s) we make the change of variable x = (n + a)t, 
where n > 0, to obtain 
Ts) = Í ext! dx = (n+ ar [ E 
o 0 
or 


(n + a) Ths) = Í ez "te "in Vat: 
o 


Summing over all n > 0 we find 
taros È [eerta 
n-0 40 
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the series on the right being convergent if ¢ > 1. Now we wish to interchange 
the sum and integral. The simplest way to justify this is to regard the integral 
as a Lebesgue integral. Since the integrand is nonnegative, Levi's convergence 
theorem (Theorem 10.25 in Reference [2]) tells us that the series 


G 
» eg Me atps—1 


converges almost everywhere to a sum function which is Lebesgue-integrable 
on [0, +00) and that 


Os, a(S) = Y | e""e-"P-'dt— | Y eea- dr, 
n-0 40 0 n-0 
But if t > 0 we have 0 < e^' < 1 and hence 
S 1 
—nt = 
ze l-e” 


the series being a geometric series. Therefore we have 


Lj —at,s-l 
Me. uet pP 

X e "e ates Li 1 EL 

n-0 —E 


almost everywhere on [0, +0), 


in fact everywhere except at 0, so 


© oc © 5—at,s-1 
s,s) = | F errena- ge = i P d 

0 n=0 o l—-e 
This proves (5) for real s > 1. T. 
that both members are analyti 


20 e [571 © gatjo—1 1 Lj e 9-1 
= aS Í -— dt- Í f —À—À 
Í Lese os distet (f «f [05 


If 0x t € 1 we have (*7! « i5, 


and if t> 1 we have 177! < 71 Also, 
since e' — 1 > t fort > 0 we have 


1 5-at,0—1 1 5(1—a)t,ó 1 1-a 
e p e t x = e 
Tref ; : dt < e «|? ldt= S 

o e o E= o ô 


and 


9 g-ata-1 © g-atc-1 © g-attc-1 
Í E as f 1 zdz [ Toon tt = Tek, a). 


—e " 


This shows that the integral in (5) converges uniformly in every strip 1 + ô 
S0<c, where ó 0, and therefore Tepresents an analytic function in 
every such strip, hence also in the half-plane o > 1. Therefore, by analytic 
continuation, (5) holds for all s With o> 1. 0 
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12.4 A contour integral representation for the 
Hurwitz zeta function 


To extend ¢(s, a) beyond the line ø = 1 we derive another representation in 
terms of a contour integral. The contour C is a loop around the negative 
real axis, as shown in Figure 12.1. The loop is composed of three parts 
C,, C2, C3. C; isa positively oriented circle of radius c < 27 about the origin, 
and C,, C, are the lower and upper edges of a “cut” in the z-plane along the 
negative real axis, traversed as shown in Figure 12.1. 


-e0 


a C3 


ee ) 


Figure 12.1 


This means that we use the parametrizations z = re^" on C, and z = re^ 
on C, where r varies from c to +00. 


Theorem 12.3 If 0 < a < 1 the function defined by the contour integral 


1 zv 1 gaz 
pe d 
Is, a) xl. pes 


is an entire function of s. Moreover, we have 
(6) t(s, a) = T(1 — s)I(s,a) ifo > 1: 


Pnoor. Here z* means re^ "* on C, and r'e"* on C3. We consider an arbitrary 
compact disk |s| « M and prove that the integrals along C, and C; converge 
uniformly on every such disk. Since the integrand is an entire function of s 
this will prove that J(s, a) is entire. 

Along C, we have, for r > 1, 


jz" | = aie me || = r7~ Lert x rM 31gnM 
Since |s| « M. Similarly, along C3 we have, for r > 1, 
j=] = pai [enit 1410) E pole" < rM AgnM. 


Hence on either C, or C, we have, for r > 1, 


sé 1e% rM- lee 1 r^^ IgxMoU -ar 


1—e 
Bute -1> e'/2 when r > log 2 so the integrand is bounded by Ar™~! e~ar 
where A is a constant depending on M but not on r. Since | rM !e-er dy 
converges if c > 0 this shows that the integrals along C, and C, converge 


z 


1 — e” e—1 
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uniformly on every compact disk |s| < M, and hence I(s, a) is an entire 
function of s. à 
To prove (6) we write 


2nil(s, a) — (f. + Ie + fae 19(z) dz 


where g(z) = e'*/(1 — e). On C, and C; we have g(z) = g(—r), and on C; 
we write z = ce, where —z < 0 < n. This gives us 


2nil(s, a) = fa le7nisq( —r) dr + i f c3 tes- Pc gig (cei) d0 


n 


+ j p teg(—r) dr 


= 2i sin(zs) | r!g(—r) dr + ic Í e™®g(ce"®) q0. 
Dividing by 2i, we get 
zl(s, a) = sin(zs)I (s, c) + I($, c) 
say. Now let c — 0. We find 


© ,5—1,—ar 
r 


lim I &9- [ = 
c0 i os Le 


dr = T(s)e(s, a), 


if c > 1. We show next that lim... I;(s, c) = 0. To do this note that g(z) 
is analytic in |z| < 27 except for a first order pole at z — 0. Therefore zg(z) 
is analytic everywhere inside |z| < 2x and hence is bounded there, say 
lg(z)) < A/Iz|, where |z| = c < 2z and A is a constant. Therefore we have 


Ux, c) < sf eA ag < Aele, 
c 


-z 


If c » 1 and c 5 0 we find T,(s,c) + 0 hence nl(s, a) = sin(zs)L'(s)/(s, a). 
Since I'(s)T(1 — s) = z/sin zs this proves (6). 


12.5 The analytic continuation of the 
Hurwitz zeta function 


In the equation ((s, a) = I(1 — s)I(s, a), valid for c > 1, the functions 
I(s, a) and T(1 — s) are meaningful for every complex s. Therefore we can use 
this equation to define Qs, a) foro < 1. 


Definition If ¢ < 1 we define Us, a) by the equation 


(7) Us, a) = T(1 — s)I(s, a). 
This equation provides the anal 


lytic continuation of Us, a) in the entire 
s-plane. 
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12.6: Analytic continuation of ¢(s) and L(s, y) 


Theorem 12.4 The function ((s, a) so defined is analytic for all s except for a 
simple pole at s — 1 with residue 1. 


PROOF. Since I(s, a) is entire the only possible singularities of C(s, a) are the 
poles of I'(1 — s), that is, the points s = 1, 2, 3, ... But Theorem 12.1 shows 
that C(s, a) is analytic at s = 2, 3,...,so s = 1 is the only possible pole of 
Us, a). 

Now we show that there is a pole at s = 1 with residue 1. If s is any integer, 
say s = n, the integrand in the contour integral for I(s, a) takes the same 
values on C, as on C, and hence the integrals along C, and C, cancel, 
leaving 


Dr zu zie 
=, dz = Res —_.. 
Tin al xi] Sey Fase 
In particular when s — 1 we have 
e? ze” J z . -1 
Hp = = lim = lim = lim — = -1 
(1, a) Re ET r0 LE zol — 20,.€ 


To find the residue of ((s, a) at s = 1 we compute the limit 
lim(s — 1){(s, a) = — lim(1 — sST(1 — s)I(s, a) = —I(1, m IQ — s) 
31 sol s 


- I(1)- 1. 
This proves that ¢(s, a) has a simple pole at s = 1 with residue 1. Dn 


Note. Since ¢(s, a) is analytic at s = 2, 3,... and T(1 — s) has poles at these 
Points, Equation (7) implies that J(s, a) vanishes at these points. 


12.6 Analytic continuation of (s) and L(, x) 
In the introduction we proved that for ø > 1 we have 
Cs) = (s, 1) 


and 
(8) Ls) =k ors r), 


where y is any Dirichlet character mod k. Now we use these formulas as 
definitions of the functions t(s) and L(s, x) fore < 1. In this way we obtain the 
analytic continuation of {(s) and L(s, x) beyond the line ø = 1. 


Theorem 12.5 (a) The Riemann zeta function (s) is analytic everywhere 


except for a simple pole at s = 1 with residue iB 1 . 
(b) For the principal character y; mod k, the L-function L(s, x,) is analytic 


everywhere except for a simple pole at s = 1 with residue q(k)/k. 
(c) I«fy X1; Lís, X) is an entire function of s. 
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Proor. Part (a) follows at once from Theorem 12.4. To prove (b) and (c) 
we use the relation 


[U if x x x; 
,24,,/07 Na itx - zn. 


Since C(s, r/k) has a simple pole at s = 1 with residue 1, the function x(r){(s, r/k) 
has a simple pole at s = 1 with residue y(r). Therefore 


k 
Res L(s, x) = lim(s — 1)L(s, y) = lim(s — Dk? 2 e i) 
s=1 so sol r= 


Ain 0  ify£i 
=7 LAN=joh . 
p» E ify-—x. 


12.7 Hurwitz's formula for ( (s, a) 


The function (s, a) was originally defined for ø > 1 by an infinite series. 
Hurwitz obtained another series representation for ((s, a) valid in the half- 


planec « 0. Before we state this formula we discuss a lemma that will be used 
in its proof. 


Lemma 1 Let S(r) denote the region that remains when we remove from the 
z-plane all open circular disks of radius r, 0 < r < 7, with centers at 
z = 2nni,n = 0, +1, £2,...Thenifü <a < 1 the function 


vat 
g(z) = TEST: 


is bounded in S(r). (The bound depends on r.) 
Pnoor. Write z = x + iy and consider the punctured rectangle 


Or) = {z: |x] < 1, |y] <x, |z| > r}, 
shown in Figure 12.2. 


Figure 12.2 
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This is a compact set so g is bounded on Q(r). Also, since |g(z + 2zi)| = 
|g(z)|, g is bounded in the punctured infinite strip r 


{z:|x| < 1, |z — 2nni| > r,n = 0, +1, +2,...}. 


Now we show that g is bounded outside this strip. Suppose |x| > 1 and 
consider 


- E: z^ 
Igel is^ Sel lel 


For x > 1 we have |l — e*| = e* — 1 and e* < e*, so 
1 1 e 


legi S p eq e-1° 


Also, when x < —1 we have |1 — e*| = 1 — e* so 


leal « — < $2 1 mil set? ons 
l-—e 1-e 1-e e—1 


Therefore |g(z)| < e/(e — 1) for |x| z 1 and the proof of the lemma is 
complete. O 


We turn now to Hurwitz’s formula. This involves another Dirichlet 

series F(x, s) given by 
© p2xinx 

(9) F(x, s) = 2 
where x is real and ø > 1. Note that F(x, s) is a periodic function of x with 
period 1 and that F(1, s) = (s). The series converges absolutely if c > 1. 
If x is not an integer the series also converges (conditionally) for c > 0 
because for each fixed nonintegral x the coefficients have bounded partial 


sums. 


2 


n 


Note. We shall refer to F(x, s) as the periodic zeta function. 
Theorem 12.6 Hurwitz's formula. If 0 < a < 1 and o > 1 we have 


IS) o ig nis/2 

10 1 = 5,4) —-—— (e *"? F(a; s)-E e" F(—aj3)]- 
(10) (1 — s, a) On} e ( 

Ifa # 1 this representation is also valid for o > 0. 


PRoor. Consider the function 


s—1 az 


1 zoe 
Iy(s, a) = Sé tee 


where C(N) is the contour shown in Figure 12.3, N being an integer. 


dz, 
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o (2N * 2)ni 
e0 R=(QN+1)0 


Figure 12.3 


First we prove that limy... Ix(s, a) = I(s, a) if o < 0. For this it suffices 
to show that the integral along the outer circle tends to 0 as N > oo. 
On the outer circle we have z = Re”, —z < 0 < n, hence 
jz, = [Rr 1e%- 0| = RO- te~"? < RIT ter, 


Since the outer circle lies in the set S(r) of Lemma 1, the integrand is bounded 
by Ae"! R*7!, where A is the bound for |g(z)| implied by Lemma 1; hence 
the integral is bounded by 
2nAe™IR?, 
and this ^ 0 as R — oo if ø < 0. Therefore, replacing s by 1 — s we see that 
(11) lim I,(1 — sa) = I1—5,a) ifo>1. 
No 


Now we compute Iy(1 — s, a) explicitly by Cauchy’s residue theorem. We 
have 


N N 
I(1—sa)2— Y R(n) =- 2, (Rt) + R(—n)} 


n=—-N 
n#0 
where 
R(n) = Res (5) 
z—2nxi Le à 
Now 
? -se e2nnia A 2nni 2nria 
Rr anima) (Cees re E a EE 
2 2nzi 1—e Qnüif..1-—e (anziy " 


12.8: The functional equation for the Riemann zeta function 


hence 


2nixia N  g-2nnia 


N 
I(l — s, a) = p Quay * p (-2nuij' 


But i-* = e^"? and (— i) * = e**? so 


e 553 N e2nnia erisl2 N e 2nnia 


I-sa) = Gor ig tg w 


Letting N — oo and using (11) we obtain 
—nis/2 gris 


e e Quim ] 
I(1 — s,a) = wy s)+ On F(—a, s). 


Hence 


(1 — s, a) = T()I(1 — s, a) = ay (e^ "57 E(a, s) -- e*S2F(—a,s)}. O 


12.8 The functional equation for the 
Riemann zeta function 


The first application of Heswitz's formula is Riemann’s functional equation 
for C(s). 


Theorem 12.7 For all s we have 


(12) ga- s) = a2) "reo (ar o 

or, equivalently, 

(13) t(s) = 20xy-?T(1 — ssin( ca - s. 
Proor. Taking a = 1 in the Hurwitz formula we obtain, for c > 1, 

AL TG) p -ristis nis/2, = I um 
Q(1—s)- Tn” e t(s) + e™ Cs) xy 2 cos| 3 t(s). 

This proves (12) for ¢ > 1 and the result holds for all s by analytic continua- 
tion. To deduce (13) from (12) replace s by 1 — s. o 


Note. Taking s = 2n + 1 in (12) where n = 1,2, 3, .. ., the factor cos(xs/2) 
vanishes and we find the so-called trivial zeros of (s), 


t(—2n)=0 forn= 1, 2, 3, .-. 
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The functional equation can be put in a simpler form if we use Legendre's 
duplication formula for the gamma function, 


1 
2n'/?2-25T (2s) = ror(: + j 


which is the special case m — 2 of Equation (4). When s is replaced by 
(1 — s)/2 this becomes 


Pap — s) = ( i y = j 


Since 


this gives us 


2^75n zum p 
Mi ~ s)sin > = 


n5 


Using this to replace the product (1 — s)sin(zs/2) in (13) we obtain 


v jin = c E ‘a -3 


In other words, the functional equation takes the form 
®(s) = (1 — s), 
where 
d) = erako 


The function ®(s) has simple poles at s = 


Oands = 1. Following Riemann, 
we multiply d(s) by s(s — 1)/2 to remove thi 


e poles and define 
jl 
&(s) = 5 (5 — De). 
Then &(s) is an entire function of s and satisfies the functional equation 


5) = (1 — s). 
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12.9 A functional equation for the Hurwitz 

zeta function 
The functional equation for {(s) is a special case of a functional equation 
for ¢(s, a) when a is rational. 


Theorem 12.8 If h and k are integers, 1 < h < k, then for all s we have 


h (s) & zs 2mhV r 
95 (i -si-a BG k Xs i) 


Proor. This comes from the fact that the function F(x, s) is a linear combina- 
tion of Hurwitz zeta functions when x is rational. In fact, if x = h/k we can 
rearrange the terms in (9) according to the residue classes mod k by writing 


n=qk+r, where 1 < r < kand q = 0,1,2,... 


This gives us, for ø > 1, 


F h zm g2minh|k k œ k 5 m $ 
ee) S = = gii 
( i s) L n 2 A, (qk + pDn e i q-0 3 


k : r 
e enis. r). 
Therefore if we take a = h/k in Hurwitz’s formula we obtain 


h Ts) & —nis/2,,2nirh/k gis[2,,, — 2nirh/k ( A ) 
—s-j- : + ere Us 
(i » 3 quy t Na 


=H $ (8-22 
= O S cos $ E Ax» 


=1 


which proves (14) fora > 1. The result holds for all s by analytic continuation. 
O 


It should be noted that when h = k = 1 thereis only one term in the sum in 
(14) and we obtain Riemann’s functional equation. 


12.10 The functional equation for L-functions 


Hurwitz’ formula can also be used to deduce a functional equation for the 
irichlet L-functions. First we show that it suffices to consider only the 
Primitive characters mod k. 
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Theorem 12.9 Let x be any Dirichlet character mod k, let d be any induced 
modulus, and write 


xn) = v) 


where W is a character mod d and y, is the principal character mod k. Then 
for all s we have 


) 
1.2 = us pr - 8. 
plk p 
Proor. First keep ø > 1 and use the Euler product 


1 
His Deng 
p 
Since x(p) = W(p)x,(p) and since y,(p) = 0 if p|k and x,(p) = 1 if p J k we 
find 


PE Ee q = pk p 
p 


= LG.) TT ( E w), 


pik 


w a D gg I - 5) 
P 


This proves the theorem for o > 1 and we extend it to all s by analytic 
continuation. o 


Note. If we choose d in the foregoing theorem to be the conductor of » 
then y is a primitive character modulo d. This shows that every L-series 
L(s, y) is equal to the L-series L(s, V) of a primitive character, multiplied 
by a finite number of factors. 


To deduce the functional equation for L-functions from Hurwitz's 
formula we first express L(s, y) in terms of the periodic zeta function F(x, s). 


Theorem 12.10 Let y bea primitive character mod k. Then for a > 1 we have 
x E h 
(15) G(1, )LG, x) = Y x(h)F G, s), 
h=1 
where G(m, x) is the Gauss sum associated with » 


k 
G(m, x) = Y y(r)e?"imn. 
r=1 
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Proor. Take x = h/k in (9), multiply by x(h) and sum on h to obtain 


k 
Ys. ;- y » (e? n7: = = v z(h)e2rinhk 


h-1n-1 
- y n^*G(n, 3). 
n=1 
But G(n, 3) is separable because ; is primitive, so G(n, 3) = x(n)G(1, 3), hence 


k o 
» HF s) = GUL, D Y xen^* = GU, DLs, 2). n 
h=1 n=1 


Theorem 12.11 Functional equation for Dirichlet L-functions. If y is any 
primitive character mod k then for all s we have 


469 L(l-s,x= oe g "5 4. y(—1e™}G(1, x)L(, 3). 


Pnoor. We take x = h/k in Hurwitz's formula then multiply each member 
by x(h) and sum on h. This gives us 


1 h\ _ TO) fus y rlt 
DE =s, j- P» Hf PX 


n —h 
4er y ed 2 
h=1 k 


Since F(x, s) is periodic in x with period 1 and x(h) = x(—1)x(—h) we can 
write 


= -h 
» an(s) x-9 È icons) 
hmodk hmodk 
k—h 
-q-0 E s- nr( , D 
hmodk 
h 
- x) Y nr...) 
hmod k 
and the previous formula becomes 


k h T(S) , -«is2 — peris nn; ) 
Xe - 92) = gem em È nrt s 


Now we multiply both members by i57! and use (15) to obtain (16). m) 
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12.11 Evaluation of €(—n, a) 


The value of {(—n, a) can be calculated explicitly if n is a nonnegative integer. 


Taking s = —n in the relation ¢(s, a) = T(1 — s)I(s, a) we find 


k-n, a) = T(1 + n(—n,a) = n!I(—n, a). 


We also have 


I(—n, a) = Res (=). 


zoMl-e 
The calculation of this residue leads to an interesting class of functions known 
as Bernoulli polynomials. 


Definition For any complex x we define the functions B,(x) by the equation 


z", where |z| « 2z. 


n=o n! 


The numbers B, 


(0) are called Bernoulli numbers and are denoted by B,. 
Thus, 


B, 


t 
Sp" Yu? where |z| < 2z. 
azon! 


Theorem 12.12 The functions B,(x) are polynomials in x given by 


B,(x) = E (ner 


k=0 
PRoor. We have 


eo 


B(x), z xo S Bea ex, 
DEn ae XE ze) 


from which the theorem follows. 


Theorem 12.13 For every integer n 7 0 we have 


(17) (=n, d)zs B,+1(a) 
ntl 
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Proor. As noted earlier, we have ((—n, a) = n!I(—n, a). Now 


z7r- lea AN E 
1csa = Ree) = ten 


egg eg Bd a B,+1(@) 
AN ei: Eus )- -arD 


z=0 m=0 


from which we obtain (17). a 


12.12 Properties of Bernoulli numbers and 
Bernoulli polynomials 


Theorem 12.14 The Bernoulli polynomials B. (x) satisfy the difference equation 


(18) B(x + 1) — B(x) = nx"! ifnzl 
Therefore we have 
(19) B,(0) = B,(1) ifn 2 2. 
Proor. We have the identity 
(x 1)= ee 
if pif amu? 


from which we find 


Bx U= B,(x) EL 5 X ql 
n-0 


p n! 
Equating coefficients of z" we obtain (18). Taking x — Oin (18) we obtain (19). 


Theorem 12.15 If n > 2 we have 


B, = DH 


,PRoor. This follows by taking x — 1 in Theorem 12.12 and using(19. O 


nformula for computing Bernoulli numbers. 


Theorem 12.15 givesa recursio: t e 1 
d Theorem 12.15 yields in succession the 


The definition gives Bg = 1, an 


values 
1 1 
By =1, B,=—5, Bi B-0 B= uy 
1 
B, — 0, B. 75> B; = 0, Bs = — 39° B, = 0, 
5 
Bio = gg: B,-0 
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From a knowledge of the B, we can compute the polynomials B,(x) by using 
Theorem 12.12. The first few are: 


1 1 
Bo(x) = 1, By(x) = x — PE B(x) =x? — x + H 
[oHm Oe Bax) = x* — 2x3 4 2b 
3 2 2 3 4 30° 
We observe that Theorems 12.12 and 12.15 can be written symbolically 
as follows: 


B(x) = (B + x}, B, = (B + 1}. 


In these symbolic formulas the right members are to be expanded by the 
binomial theorem, then each power B* is to be replaced by B,. 


Theorem 12.16 If n > 0 we have 
cca 
Q0) Sn) = - Ter. 


Also, if n 2 1 we have ¢(—2n) = 0, hence Bons = 0. 


Proor. To evaluate &(—n) we simply take a = 1 in Theorem 12.13. We have 
already noted that the functional equation 


(21) (1—s)= 228)" s}09( Nts 
implies /(—2n) = 0 for n > 1, hence Bons 1 = 0 by (20). o 
Note. The result B;, , 1 


= O also follows by noting that the left member of 


z 1 S Ban 
e ty oe 


is an even function of z. 
Theorem 12.17 If k is a positive integer we have 
(27)?*B. 
22 2k) = (— 1*1 KT B, 
(22) (2k) = (— 1} OT 


Pnoor. We take s = 2k in the functional equation for t(s) to obtain 


(1 — 2k) = 2(2n)-?*r Qk)cos(nk)t(2k), 
or 
- Bu = 2(2n)- 2 2k 1 
e (2k — 1) (— DKK). 
This implies (22). 
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Note. If we put s = 2k + 1 in (21) both members vanish and we get no 
information about (2k + 1). As yet no simple formula analogous to (22) 
is known for C(2k + 1) or even for any special case such as ((3). It is not even 
known whether Z(2k + 1) is rational or irrational for any k. 


Theorem 12.18 The Bernoulli numbers B5, alternate in sign. That is, 
(—1)*1B,, > 0. 
Moreover, | B3,| > oo as k > oo. In fact 


20k)! 


om as k oo. 


(23) (—1**!B5, ~ 


PROOF. Since (2k) > 0, (22) shows that the numbers B,, alternate in sign. 
The asymptotic relation (23) follows from the fact that (2k) > 1 as k + oo. 


Note. From (23) it follows that |B,+2/B2,| ~ k?/n? as k > œ. Also, by 
invoking Stirling’s formula, n! ~ (n/e)"\/22n we find 


k 2k*1/2 
(1) Ba ~ eA) as k + co. 


The next theorem gives the Fourier expansion of the polynomial B,(x) 
in the interval 0 < x < 1. 


Theorem 12.19 /f0 < x < 1 we have 


n! +o e2nikx 
(24) B,(x) Lj 2m Qi NA p 
k#0 


and hence 
2(2n)! & cos 2nkx 


23 k?” a 


(2n)?" k=1 


Bjx) = (—1)"** 


2(2n + 1)! & sin 2xkx 
Boner) = (71^ expen 2, [pa 


PROOF, Equation (24) follows at once by taking s = n in Hurwitz’s formula 
and applying Theorem 12.13. The other two formulas are special cases of 


(24) 


Note. The function B,(x) defined for all real x by the right member of 
(24) is called the nth Bernoulli periodic function. It is periodic with period 1 
and agrees with the Bernoulli polynomial B,(x) in the interval 0 < x < 1. 

hus we have 
Bx) = B,(x — Lx). 
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12.13 Formulas for L(0, y) 
Theorem 12.13 implies 


1 
(0, a) = —By(a) = ; —a. 


In particular (0) = (0, 1) = — 1/2. We can also calculate L(0, x) for every 
Dirichlet character y. 


Theorem 12.20 Let y be any Dirichlet character mod k. 
(a) If x = x (the principal character), then L(0, 21) 2 0. 
(b) If x # xı we have 


1 k 
L0, 3) — —r Yr) 
r=1 
Moreover, L(0, y) = 0 if y(— 1) = 1. 


Proor. If  — y, we use the formula 


L{s, xı) = Us) [TQ — p75) 
pik 


proved for c > 1 in Chapter 11. This also holds for all s by analytic continua- 
tion. When s = 0 the product vanishes so L(0, x,) = 0. 
If y # y, we have 


z r k joy | 
L(0, x) = Lanr(o, i) = à = z) —$ in. 
Now 


k k k k 
Yne)-Yk-n-n-kyxk-»- È rx(—r) 
r=1 r=1 r=1 rat 


k 
= —7(-1) 2 rir). 


Therefore if «( —1) = 1 we have Yt arx) = 0. 


12.14 Approximation of C(s, a) by finite sums 


Some applications require estimates on the rate of growth of (c + it, a) asa 
function of t. These will be ded 


t uced from another representation of ¢(s, a) 
obtained from Euler’s summation formula. This relates Us, a) to the partial 
sums of its series in the half-pl: 


À ane c > 0 and also gives an alternate way to 
extend ¢(s, a) analytically beyond the line g zx. 
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Theorem 12.21 For any integer N > 0 and o > 0 we have 


1 (N + a) e x — [x] 


pa 


N 
Q3) &sa)= g 


aco (n + af s—l 
Proor. We apply Euler's summation formula (Theorem 3.1) with f(t) = 
(t + a)~* and with integers x and y to obtain 
1 p wi * t— [t] 
-s i 
dii. (n + af [ (t + ay y Cay! 
Take y = N and let x > co, keeping o > 1. This gives us 
= 1 2 dt * t—[t] 
-s A 
chin (n+a Jy (t+ ay w (t+ ay"? 


1 JWtore (Pts 


N 
td gag st “wee 
This proves (25) for o > 1. If o > 5>0 the integral is dominated by 
f$ (t + a)-?-! dt so it converges uniformly for ø = ô and hence represents 
an analytic function in the half-plane c > 0. Therefore (25) holds for c > 0 
by analytic continuation. 


The integral on the right of (25) can also be written as a series. We split 
the integral into a sum of integrals in which [x] is constant, say [x] = n, 
and we obtain 

oo 1 
oo x— [x] LJ ntl x—n d Í u s 
FE y y = — zr dx = ore ET u. 
f. (x + ay ™ X e epa) p o (u t n ay** 


Therefore (25) can also be written in the form 


NN (N aes EH ee S 
e a 2 (n F a e i (utn+a)** d 


c ay s—1 
ifo > 0. Integration by parts leads to similar representations in successively 
larger half-planes, as indicated in the next theorem. 
Theorem 12.22 If o > —1 we have 
3 N -a)^7 
Q7)  ((sa)— Y, I sps (Nen ) 


ont  s-l 


s N 1 
x fes +1,a)— Pa (n 4 anl 


¢41) 2 [ ut 
SUC p eee 
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More generally, if o > —m, where m = 1,2, 3,..., we have 
x 1 (N c a)!^5 e s(s + 1)---(s+r— 1) 
VR) oct) X (nap —s-1 à (r4 1)! 


N 1 
x {ie + na) = 22: es" 


n=o (n+ ay** 


_ ss + D): (m) 
(m + 1)! 

LJ 1 unt! 
L——— uu du. 
Aenean 

Proor. Integration by parts implies 
Í u du u? a, Sek 1 u? du 
(ntatuy*? 2n ad uy 2 (n+a+ uft?’ 


so if ø > 0 we have 


aout u du bos 1 
à f, (n a uj?! 2. (n - a 4 1j! 
s+12 f u? du 
o (n ac ut? 
But if o > 0 the first sum on the right is ((s + 1, a) — No (n + a)*7! and 
(26) implies (27). The result is also valid foro > —1 by analytic continuation. 
By repeated integration by parts we obtain the more general representation 


in (28). o 


t3 


12.15 Inequalities for |C(s, a)| 
The formulas in the foregoing section yield upper bounds for |{(¢ + it, a)| 
as a function of t. 
Theorem 12.23 (a) If 6 > 0 we have 
(29) I{(s,a)—a~*| x (1--à) ifo>1 +ô. 


(b) If0 < ô <1 there isa positive constant A(5), depending on 6 but nd on 
s or a, such that 


G0) |&(s, a) —a7*| < A(d)|e/° f1 -<c x<2and]t|>1, 
G1) 1&(s, a) — a7*| < A(d)|e [+8 if -ó € c <5dand|t|>1, 
G2) |&s a)| < AQ)rm* 1*2 if -n—-ósocx« 
where m = 1,2,3,... 


—m + ó and |t| > 1, 
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Proor. For part (a) we use the defining series for C(s, a) to obtain 


eo 


Is, à) —a^| x Y 


Es 
— X 
n=i(n + a)” 2, n 


which implies (29). 
For part (b) we use the representation in (25) when 1 — ô < o < 2 to 
obtain 


ao ME MM (N  a)* om Mx 
sacs Y ort uoa tU gar 


N T= 
«1 f de (S chat 1s H (N +a™. 


1 (x +a)’ |s— 1 


Since c > 1 — ô > 0 we have (x + a)’ > (x + a)! ^? > x!^*so 
N dg N dx N? 
Í (x + ay Xen! © Oe 
Also, since |s — 1| = |ø — 1 + it| > |t| > 1 we have 
1-c 
RAO Sw tal sin + oF 
ae 
Finally, since |s| < |a| + |t| < 2 + |t| we find 
E MENU ud * nk 
Elin a 4g WI se P 


These give us 


N? 24 |t N? 
la —a7] «1 6 EON E D'E T3 NC 


Now take N = 1 + [|t|]. Then the last three terms are O(\t|°), where the 
constant implied by the O-symbol depends only on 6. This proves (30). 
To prove (31) we use the representation in (27). This gives us 


N Nay 1 -s- 
Kea- ais Yo T + 3l + D, a) — a=} 


igy cliam PUR se: 11'S e 
alld aaa +2 d nay 


As in the proof of (30) we take N = 1 + [|t|] so that N = O(|t|) and we 
Show that each term on the right is O(|¢|'**), where the constant implied 
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by the O-symbol depends only on ô. The inequalities —ó < o < ô imply 
1—5<1—o0<1+6,hence 


Ne iyi Wa idx (N tae 
— <1 1 
2 mra +] Gap t To 
1*6 
«14 CED ogg, 


is 1-6 
Since |s — 1| > |t| > 1.the second term is also O(|t|' *?). For the third term 


we use (30), noting that 1 — ô < c + 1 € 1 + ô and |s| = O(|t|), and we 
find that this term is also O(|t|' *^). Next, we have 


N 1 N dx 
lY Gat oft Po 3) 
= O(ItIN75) = Olti = oq: 99). 


Finally, 


Isls + 1È ce = oi [7 ES O(|t PN777! 
arae PE Jy gegen) = EN) 
= (IAN?) = o(pt*9) 


This completes the proof of (31). 


The proof of (32) is similar, except that we use Q8)and notethata^* = O(1) 
when c « 0. 


12.16 Inequalities for |Z(s)| and |L(s, y)| 


When a = 1 the estimates in Theorem 12.23 give corresponding estimates 
for | {(s)|. They also lead to bounds for Dirichlet L-series. Ifo > 1 + 5, where 


ô > 0, both |¢(s)| and |L(s, z)| are dominated by ¢(1 + ô) so we consider 
only c < 1 + ô. 


Theorem 12.24 Let y be any Dirichlet character mod k and assume 0 < ô < 1. 


Then there is a positive constant A(6), depending on 6 but not on s or k, 
such that for s = o + it with |t| > 1 we have 


63  ILG Is AG)aP*'** iff2m-ósoz ma ô, 


where m = —1,0, 1:09: 


Pnoor. We recall the relation 


k-1 
Ls 2) = k"*y nos, r) 
r=1 k 
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If m = 1, 2, 3,... we use (32) to obtain 


T 


which proves (33) for m > 1. If m = 0 or —1 we write 


k-1 k-1 - 
ET 
(34) Esa "d k l0 qs É i 


r=1 


k-1 


IL(s, 3)] € k^* Y, 


r=1 


< kt Ad) [tries 


-Since —m — ô < e € —m + 6 we can use (30) and (31) to obtain 


t- 


so the second sum in (34) is dominated by A(6)|kt|"*1**. The first sum is 
dominated by 


Kn? < kn**4(5)|r|n* 1*5, 


k-1 1 k-1 ? k - pnt its fmt its 
ez m m E a_i < —, 
PDEEEX, eis f dx m+1+6 ó 
and this sum can also be absorbed in the estimate A(ó)| kt|"* ! **. D 


Exercises for Chapter 12 
1. Let f(n) be an arithmetical function which is periodic modulo k. 
(a) Prove that the Dirichlet series Y; f(n)n~* converges absolutely foro > land that 
oo k 
fe ex rods r) ifo>1. 
ami" r=1 k 


(b) If Y, f(r) = 0 prove that the Dirichlet series X San converges foro >0 
and that there is an entire function F(s) such that F(s) = Y. f(n)n^* for o > 0. 


2. If x is real and c > 1, let F(x, s) denote the periodic zeta function, 


GJ ern 
Fos) = Da 
If0 < a < 1 and c > 1 prove that Hurwitz's formula implies 
F(a,s) = T(1 — s) qeu- — s a) + gris- DAE — s 1 — a). 
(27)! =s 


3. The formula in Exercise 2 can be used to extend the definition of F(a, s) over the 
entire s-plane if0 < a < 1. Prove that F(a, s), so extended, is an entire function of s. 


4. If0<a< Land 0 <b < 1 let 


T(s) 


(t(s, a)F(b, 1 + s) + Us, 1 — a)F(1 — b, 1 + s)}, 
Qzy 


Qa, b, s) = 
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where F is the function in Exercise 2. Prove that 


Oa,b,s) sis Lia 1- al-s, b 
STE 12 (Es, alf(—s, 1 — b) + Us, 1 — a)t(—s, b) 


eT (s, 1 — b)(s, 1 — a) + (— s, bs a)}, 
and deduce that ®(a, b; s) = ®(1 — b, a, —s). This functional equation is useful in 
the theory of elliptic modular functions. 


In Exercises 5, 6 and 7, č(s) denotes the entire function introduced in Section 
12.8, 


&s) = 5a = ner(Sko. 


5. Prove that (s) is real on the lines t = 0 and o = 1/2, and that £(0) = &(1) = 1/2. 


6. Prove that the zeros of £(s) (if any exist) are all situated in the strip0 < ø < 1andlie 
symmetrically about the lines t = 0 and o = 1/2. 


7. Show that the zeros of ¢(s) in the critical strip 0 < ø < 1 (if any exist) are identical 
in position and order of multiplicity with those of é(s). 


8. Let y be a primitive character mod k. Define 


oa f0 ifx-1)=1, 
a= ag)={' ifx-1)- —L 


(a) Show that the functional equation for L(s, y) has the form 


ms — a) 


L(1 — s, 3) = e(x)2Qx)-*ke7* ex Jr. 2), where |e(y)| = 1. 


(s+a)/2 
as. = (5 (f : jus 9. 


Show that &(1 — s, 3) = ex)&(s, x). 


9. Refer to Exercise 8. 


(b) Let 


(a) Prove that ¢(s, y) € O ifa > lore <0. 
(b) Describe the location of the zeros of L(s, y) in the half-plane c < 0. 


10. Let y be a nonprimitive character modulo k. Describe the location of the zeros of | 
L(s, x) in the half-plane ø < 0. | 


11. Prove that the Bernoulli polynomials satisfy the relations 
B,{1 — x) = (—1)"B,(x) and B,,, (3) = 0 for every n > 0. 
12. Let B, denote the nth Bernoulli number. Note that 


B,=¢=1-4-4 By=3h=1-4-4-4, 
Be =a =1-4-4-3, 
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13, 
14, 


15, 


16. 


17. 


Exercises for Chapter 12 


These formulas illustrate a theorem d'^covered in 1840 by von Staudt and Clausen 
(independently). If n 2 1 we have 


1 
Ba =l, + X + 
a 


where I, is an integer and the sum is over all primes p such that p — 1 divides 2n. 
This exercise outlines a proof due to Lucas. 


(a) Prove that 
n k 
-brail 


[Hint: Write x = log{1 + (e* — 1)} and use the power series for x/(e* — 1).] 
(b) Prove that 


a sy ks 
= E c(n, k), 


where c(n, k) is an integer. 

(c) If a, b are integers with a > 2, b > 2 and ab > 4, prove that ab|(ab — 1)!. This 
shows that in the sum of part (b), every term with k + 1 composite, k > 3, is an 
integer. 

(d) If p is prime, prove that 


gr ME AS —1(mod p) ifp— l|n n > 0, 
Sf r ref 0 (mod p) ifp — 1 n. 


(e) Use the above results or some other method to prove the von Staudt-Clausen 


theorem. ; i | 
Prove that the derivative of the Bernoulli polynomial B,(x) is nB,- ,(x) if n 2 2. 


Prove that the Bernoulli polynomials satisfy the addition formula 


Bjxy)u Y (aen ^ 


k=0 


Prove that the Bernoulli polynomials satisfy the multiplication formula 


m-1 k 
B,(mx) = v n + JI 


Prove that ifr > 1 the Bernoulli numbers satisfy the relation 


r 2*B,, £r 


D (2k)!(2r + 1 — 2k)! (2r)!" 


Calculate the integral få xB,(x) dx in two ways and deduce the formula 
z B, - By 
D p+2-r p+! k 
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E 


18. (a) Verify the identity 
" uv ft T uv ( 5 P: ree 1 ) 
(e — 1)(e — 1) u+v u +v e—1 e-—1 


L3 ea) pai 
EE: (ILL B,. 


n=2 Nn! u+ov 


(b) Let J = J} B,(x)B,(x) dx. Show that J is the coefficient of p!q!u"v* in the 
expansion of part (a). Use this to deduce that 


p!q! 


Do i i Bire ifpzlqzl 
1 
Í, B,(x)B,{x) dx = fore 0! 
0 ifp > 1,q=0;orp=0,q>1. 


19. (a) Use a method similar to that in Exercise 18 to derive the identity 


2r, 


CEOD È BBO = ix Y B.) 07 


m=0 n=0 m=0 n=0 m!n! Xo» 


2r), 


(b) Compare coefficients in (a) and integrate the result to obtain the formula 


B2Bm+n- n may min! 
"^ ga xc ys (RB mis-m Pho} ‘Wat mi Pn 


form > 1,n > 1. Indicate the range of the index r. 


20. Show that if m > 1, n > 1 and p = 1, we have 


1 
[ B«984608,6) dx 
o 


eas ea E a d (m 4 n — 2r — 1)! 
did PG ae (mE n p — 27) Oa Prnt- 


In particular, compute f? B;*(x) dx from this formula. 


21. Let f (n) be an arithmetical function which is periodic mod k, and let 


atr) =~ femer 


mmodk 


denote the finite Fourier coefficients of yat 


ty 
F(s) =k". nos i) 
r=1 


prove that 


T(s) jas iK k 
Fa1-s)- er jew Y ares, ‘) decre yq nfs i 
PN r=1 
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22. Let y be any nonprincipal character mod k and let S(x) = <x xin). 


(a) If N > 1 and g > 0 prove that 
z x(n) © S(x) — S(N) 
Ls, x) = x = +s Í ETE 
(b) If s = c + it with o > ô > 0 and |t| > 0, use part (a) to show that there is a 
constant A() such that 
ILs, 3)] < ABK (IEI + D'7* 


per bound for |S(x)|. In Theorem 13.15 it is shown that 


dx. 


where B(k) is an up 
B(k) = O(/k log k). 


(c) Prove that for some constant A > 0 we have 
s 1 " 
Ls) € Alogk ifo zi egt ds It] < 2. 


[Hint: Take N — k in part (a).] 
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Analytic Proof of the 
Prime Number Theorem 


13.1 The plan of the proof 
The prime number theorem is equivalent to the statement 
(1) V(X)-7 x asx oo, 
where f(x) is Chebyshev's function, 
U(x) = PELO 


This chapter gives an analytic proof of (1) based on properties of the Riemann 
zeta function. The analytic proof is shorter than the elementary proof 
sketched in Chapter 4 and its principal ideas are easier to comprehend. 
This section outlines the main features of the proof. 

The function y is a step function and it is more convenient to deal with its 
integral, which we denote by W,. Thus, we consider 


vats) = [woa 


The integral y, is a continuous piecewise linear function. We show first that 
the asymptotic relation 


1 
(2) Wile) ~ zx! asx- oo 
implies (1) and then 


L prove (2). For this purpose we express W,(x)/x? in terms 
of the Riemann zeta 


function by means of a contour integral, 


Wi) _ 1 |o xu ( UG) 


xn Da ETHER - $2) a, where c > 1. 
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The quotient — C'(s)/C(s) has a first order pole at s — 1 with residue 1. If we 
subtract this pole we get the formula 


y(x) 1 N 1 toi yxs-l (-i8- 1 ) i 
x? -3(1 J Er aw m 


We let 


1 coe 1 ) 
MS) = Ie ak Q) s-1 
and rewrite the last equation in the form 


E hel I ( -i m TU 7I) ds 


x 2 x 2ni Je oi 


x^! +o " 
-—— I h(c + it)e'* !8* dt. 
Di) — 


To complete the proof we are required to show that 


-1 +o 5 
(4) lim — f Kle + ite" 8* dt = 0. 


x70 


Now the Riemann-Lebesgue lemma in the theory of Fourier series states that 


+o 
lim i f (De dt = 0 
xo -0 
converges. The integral in (4) is of this type, with x 
replaced by log x, and we can easily show that the integral ft2 Ihle + it)] dt 
converges if c > 1, so the integral in (4) tends to 0 as x > oo. However, the 
factor x*- ! outside the integral tends to 00 whenc > 1,so weare faced with an 
indeterminate form, oo - 0. Equation (3) holds for every c > 1. If we could 
put c — 1 in (3) the troublesome factor X^. would disappear. But then 
h(c + it) becomes h(1 + it) and the integrand involves [4 (N/E) on the line 
6 = 1. Inthiscaseit is more difficult to prove that the integral (*3 |A(1 ^s it)| dt 
converges, a fact which needs to be verified before we can apply the Riemann- 
Lebesgue lemma. The last and most difficult part of the proof E to show that 
it is possible to replace c by 1 in (3) and that the integral ffs |n + it)| dt 
converges. This requires a more detailed study of the Riemann zeta function 
in the vicinity of the line ø = 1. 

Now we proceed to carry out t 
some lemmas. 


if the integral f+% | f(t)| dt 


he plan outlined above. We begin with 


13.2 Lemmas 


Lemma 1 For any arithmetical function a(n) let 
Ax) = È aln), 
nsx 


279 


13: Analytic proof of the prime number theorem 


where A(x) = O ifx < 1. Then 


= "(aga 
(5) Eo- nan = [ana 
Proor. We apply Abel’s identity (Theorem 4.2) which states that 
(6) E aln) f(n) = Av Feo) — f AQf'0 dt 
nsx 


if f has a continuous derivative on [1, x]. Taking f(t) = t we have 


Laln)f(n) = P nan) and A(x) f(x) = x ¥ a(n) 
nsx nsx nsx 
so (6) reduces to (5). E 


The next lemma is a form of L’H6spital’: 


s rule for increasing piecewise 
linear functions. 


Lemma 2 Let A(x) = ism a(n) and let A,(x) = js A(t) dt. Assume also that 
a(n) > 0 for all n. If we have the asymptotic formula 


(7) A(x) ~ LxX*. asx— oo 
for some c > 0 and L > 0, then we also have 

(8) A(x) ~ cLx*^! 
In other words, formal differentiation of (7) gives a correct result. 


PRoor. The function A(x) is increasing since the a 
Choose any f 1 and consider the difference A, (£x) 


as x > oo. 
(n) are nonnegative. 
— A(x). We have 
px px 
Ai(fx) — A,(x) = f A(u) du > f A(x) du = A(x)(Bx — x) 
= x(B — 1)A(x). 


This gives us 


x46) < 7 — AB») — A409) 


or 


A(x) 1 A,(Bx) A(x) 

aisre ice 

Keep f fixed and let x > œ in this inequality. We find 
A(x) 1 f —1 


lim su. x "—[)- : 
nasg- DL 
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Now let fj — 1+. The quotient on the right is the difference quotient for the 

derivative of x‘ at x = 1 and has the limit c. Therefore 

A(x) 
E 


i — X cL. 
(9) lim sup xit E. 


x-^o 


Now consider any « with 0 « x « 1 and consider the difference 
A,(x) — A,(ax). An argument similar to the above shows that 


1-4 
ma Onr um 
xt0 X 1-4 


As a + 1— the right member tends to cL. This, together with (9) shows that 
A(x)/x* ^ ! tends to the limit cL as x + oo. D» 


When a(n) = A(n) we have A(x) = W(x), 4,69) = Y,(x), and a(n) > 0. 
Therefore we can apply Lemmas 1 and 2 and immediately obtain: 
Theorem 13.1 We have 

(10) vis) = Xx — nAn). 


nsx 


Also, the asymptotic relation (x) ~ x?/2 implies (x) ~ x as x > oo. 


Our next task is to express y, (x)/x? as a contour integral involving the 
zeta function. For this we will require the special cases k =1andk =2 of 
the following lemma on contour integrals. (Compare with Lemma 4 in 


Chapter 11.) 
Lemma 3 If c > 0 and u > 0, then for every integer k > 1 we have 
] fe a all = f0<u<1, 
iz = 4k! 
2ni Ts 2(z + 1)--- (z +k) 


0 ifu>1, 


the integral being absolutely convergent. 
Proor. First we note that the integrand is equal to u "T(z)/T(z + k + 1). 


This follows by repeated use of the functional equation T(z + 1) = zI'(z). 
To prove the lemma we apply Cauchy's residue theorem to the integral 


1 u T(z) 


fate —_—_—_— dz, 
2ni Jer) T(z + k+1) 


where C(R) is the contour shown in Figure 13.1(a) if 0 < u < 1, and that in 
Figure 13.1(b) if u > 1. The radius R of the circle is greater than 2k + c so 
all the poles at z = 0, —1,..., —k lie inside the circle. 
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(a) O<u <1 (b) w>1 
Figure 13.1 


Now we show that the integral along each of the circular arcs tends-to 0 
as R > oo. If z = x + iy and |z| = R the integrand is dominated by 


u`? u^* une 


= < B 
z+ le +k] lzllz * 1--]z - k| ^ Riz H---|z 4 Kk] 


The inequality u^* < u'* follows from the fact that u^* is an increasing 


function of x if0 < u < 1 and a decreasing function ifu > 1. Now if 1 <n<k 
we have 


lz +n|>|z|—n=R -n >R —k > R/2 
since R > 2k. Therefore the integral along each circular arc is dominated by 
2nRu ^ * 
RGR) 


and this 20 as R > œ since k > 1. 

If u > 1 the integrand is analytic inside C(R) hence fer = 0. Letting 
R — oo we find that the lemma is proved in this case. 

If 0 <u < 1 we evaluate the integral around C(R) by Cauchy's residue 
theorem. The integrand has poles at the integers n = 0, —1,..., —k, hence 


= O(R^*) 


1 u` (z) 2 u`T(z) 
2i Jom TG +k +) 7 2. Re TENTE 


E u" e uiy: 
= T(z) = 
Lap um cu almi 
Y &/k (1 — uy 
~ kl Xe AD | 
Letting R — œ we obtain the lemma. a 
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13.3 A contour integral representation 


for y, (x)/x? 
Theorem 13.2 If c > 1 and x > 1 we have 
iO rap EE POPE 
es xi xil... S(s + 5( AO) as. 


Proor. From Equation (10) we have W/,(x)/x = Enex (1 — n/x)A(n). Now 
use Lemma 3 with k = 1 and u = n/x. Ifn < x we obtain 


meo de tet (xin 


x nid oi (stl) - 


Multiplying this relation by A(n) and summing over all n < x we find 


Vi o 1 (Ct Ameum uo y T Fel Amin 5 
ox gee Wt dent sU nat 2ni Je-wi SS + 1) 


since the integral vanishes if n > x. This can be written as 
LJ *oi 
(2) no. 449) ds, 
n=1 Je- coi 


where 2zif,(x) = A(n)(x/nY Ks? + s). Next we wish to interchange the sum and 
integral in (12). For this it suffices to prove that the series 


o * oi 
(13) X f = Laas 


is convergent. (See Theorem 10.26 in [2]) The partial sums of this series 

satisfy the inequality 

x T = ACIDE ee en E adieu 
«i Islls + 1l LEA n Jesi dslls U nai on 


n=1 
where A is a constant, so (13) converges. Hence we can interchange the sum 
and integral in (12) to obtain 
" *oi oo 
ye ["" $ NUN p ES S an 
Am. [S Ef95T misse w^ 


x emi nl 
ore eae 9) 
Ini Ju sS DN OS) 
Now divide by x to obtain (11). B 
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Theorem 13.3 If c > 1 and x > 1 we have 


Wild 1 ee ee 
(14) = AC j = Oi leis L5 
where 
1 Cs) 1 
(15) h(s) Fre 5( i) B= jJ 


Pnoor. This time we use Lemma 3 with k — 2 to get 


1 1 2 1 *oi x 

al(l--) =— ds, 

2 x 2ni J wi s(s + 1)(s + 2) 
where c > 0. Replace s by s — 1 in the integral (keeping c > 1) and subtract 
the result from (11) to obtain Theorem 13.3. o 


If we parameterize the path of integration by writing s = c + it, we find 
x7} = xix" = x° le" 12* and Equation (14) becomes 


1 1 2 £1 c oi t 
(16) ^» i-i) =| Alc + it)e" 18> dt, 


Our next task is to show that the right member of (16) tends to 0 as x > oo. 
As mentioned earlier, we first show that we can put c — 1 in (16). For this 
purpose we need to study ¢(s) in the neighborhood of the line c = i" 


13.4 Upper bounds for |¢(s)| and |¢“(s)| 
near the line o = | 


To study ((s) near the line c = 1 we use the representation obtained from 
Theorem 12.21 which is valid for ¢ > 0, 


N oo m 1-s 
qn t= X n-s EL n 


1 x1 
n= 


ie also use the formula for ¢'(s) obtained by differentiating each member 
of (17), 


N 


(18) i=- Sr, fE [oE z y. 2 = D n 
N N 


Fo» x! x1 


_N*logN — Nic 
s-1 6-1 


The next theorem uses these relations to obtain upper bounds for |¢(s)| 
and |C'(s)|. 
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Theorem 13.4 For every A > 0 there exists a constant M (depending on A) 
suck that 


(19) Mi Mlogt and |č'(s)| € M log? t 


for all s with o > 1/2 satisfying 


(20) i» ge fa 
log t 


Note. The inequalities (20) describe a region of the type shown in Figure 
13.2. 


Figure 13.2 


Proor. If ¢ > 2 we have |{(s)| < (2) and |¢(s)| < |¢"(2)| and the inequalities 
in (19) are trivially satisfied. Therefore we can assume 9 < 2 and t > e. 
We then have 


ls|ao-tx2-4t«2t and|s— 1| 2 t 


so 1/|s — 1| < 1/t. Estimating | £(s)| by using (17) we find 


n xt! t ixl n? 


N 4 o | Nie Ai Be Ner 
Iis Y men dx + D ee 1 
n=1 N 


Now we make N depend on t by taking N = [t]. Then N < t < N + 1 and 
log n < log t ifn < N. The inequality (20) implies 1 — & < A/log t so 


A ni l actes c les lonir cl ga _ of! 
n n n n] 
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Therefore 
2t N+ N2=2 NOI 1 
= =0 O(1 
NS Z O(1) and : A N (1) 
so 


N è 
I9 = (x ;j + O(1) = O(log N) + O(1) = Oleg t). 
n=1 


This proves the inequality for |{(s)| in (19). To obtain the inequality for 
\¢'(s)| we apply the same type of argument to (18). The only essential difference 
is that an extra factor log N appears on the right. But log N = O(log f) so we 
get |C'(s)| = O(log? t) in the specified region. 


13.5 The nonvanishing of £(s) on the 
line o = 1 


In this section we prove that ¢(1 + it) # 0 for every real t. The proof is based 
on an inequality which will also be needed in the next section. 
Theorem 13.5 If c > 1 we have 


(21) C(c)| + it)|*\C(o + 2it)| > 1. 


Proor. We recall the identity /(s) = e69 proved in Section 11.9, Example 1, 
where 


Aln) . 2. il 
G(s) = —— = 
9 A logn" DOM mp™ G= 


This can be written as 


to) = earl È -l i <=. 


p m-i 
from which we find 


ls) = aply y en). 


p m=1 


We apply this formula repeatedly with s = c, s =o + it and s =o + 2it, 
and obtain 


(s) + inre + 2it)| 


A aply $ Y 3 + 4 cos(mt log p) + cos(2mt log p. 


e mp"* 
But we have the trigonometric inequality 


3 + 4cosÓ + cos 20 > 0 
gog 


13.6: Inequalities for |1/C(s)| and !£’(s)/C(s)| 


which follows from the identity 
3 + 4 cos 6 + cos 20 = 3 + 4 cos 0 + 2 cos? 0 — 1 = 2(1 + cos 0)". 

Therefore each term in the last infinite series is nonnegative so we obtain 

(21). oO 

Theorem 13.6 We have ((1 + it) # 0 for every real t. 

Pnoor. We need only consider t # 0. Rewrite (21) in the form 

Uo + it) |t 
c—1 


|C(o + 2it)| ve 
c—1 


(22) {(o — Dto)? 


This is valid if ¢ > 1. Now let ø > 1+ in (22). The first factor approaches 
1 since č(s) has residue 1 at the poles = 1. The third factor tends to |€(1 + 2it)]. 
If (1 + it) were equal to 0 the middle factor could be written as 


(c + it) — Q1 + it) 
1 


c= 


4 
> |C(1+ i) aso>1+. 


Therefore, if for some t # 0 we had (1 + it) = 0 the left member of (22) 
would approach the limit |¢'(1 + it)|*|C(1 + 2it)| as c > 1+. But the right 
member tends to co as ø + 1+ and this gives a contradiction. D 


13.6 Inequalities for |1/(s)| and | (9/G)] 
Now we apply Theorem 13.5 once more to obtain the following inequalities 


for | 1/¢(s)| and |C’(s)/E(s)|- 


Theorem 13.7 There is a constant M > 0 such that 


UG) 


9 
Zs) < M log’ t 


<Mlog’t and 


Y 
Cs) 
whenever a > 1 and t > e. 


Pnoor. For c > 2 we have 


1| [$2492 y Legg 
| 7 | 2 n zr i 
and 
| S AM 
uj 25 yc 


so the inequalities hold trivially if « > 2. Suppose, then, that 1 < c < 2 and 
t > e. Rewrite inequality (21) as follows: 


1 3/4 2it)|!^. 
iic ii < (oy [Gc + 2it)] 
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Now (c — 1)/(o) is bounded in the interval 1 < o < 2, say (c — 1)¢(c) € M, 
where M is an absolute constant. Then 


ifl<oa<2. 


io) < 


e — 


Also, ((c + 2it) = O(log t) if 1 < ø < 2 (by Theorem 134), so for 1 «o <2 
we have n 


1 M M?^"(log t)"^ — A(log t)!/* 
Mc ri)  (c—1?^ -1 


where A is an absolute constant. Therefore for some constant B > 0 we have 


B(c — 1)°/* 


(23) IG + i) > eor gu ifl<o<2,andt>e. 


This also holds trivially for c = 1. Let æ be any number satisfying 1 < « < 2. 
Then if 1 < o < q, t 2 e, we may use Theorem 13.4 to write 


Ilo + it) — Cx + it) < [ie + it)| du € (x — e)M log? t 


< (x — 1)M log? t. 
Hence, by the triangle inequality, 


I&c + it)| = [Cu + it)| — Ho + it) — C + it)| 


> Ua + it)| — (x — 1)M log? t > Bud (« — 1)M log? t 

(log t)/* $ 

This holds if 1 < ø < o, and by(23)it also holdsfora < ø < 2since(c — 1)? 
Z (x — 1)?^. In other words, if 1 < « < 2 and t > e we have the inequality 


Bia — 1)3/4 


Ic + it)| > log ~ (œ — 1)M log? t 


for any a satisfying 1 < a < 2. Now we make « depend on t and choose a 
so the first term on the right is twice the second. This requires 


a=1+ B a 
a 2M) (log 1)?" 


Clearly «> 1 and also æ < 2 if t > tọ for some to. Thus, if t > to and 
1 < o < 2 we have 


; C 

Uc + it)| > (x — 1)M log? t 2 ——. 

Is + iD > (æ — DM log t = c, 

The inequality also holds with (perhaps) a different C if e < t < t. 
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This proves that |C(s)| > C log"? t for all ø > 1, t > e, giving us a corre- 
sponding upper bound for |1/¢(s)|. To get the inequality for |Z'(s/C(s)| we 
apply Theorem 13.4 and obtain an extra factor log? t. D 


13.7 Completion of the proof of the prime 

number theorem 
Now we are almost ready to complete the proof of the prime number theorem. 
We need one more fact from complex function theory which we state as a 
lemma. 


Lemma 4 /f f(s) has a pole of order k at s = a then the quotient f'(s)/f(s) has a 
first order pole at s = & with residue —k. 
Pnoor. We have f(s) = g(s)/(s — o)", where g is analytic at « and g(x) # 0. 


Hence for all s in a neighborhood of « we have 


g'(s) kgs | — 99 [= 4291. 


To= (s — a} "(s-d* (-a'ls-« g(s) 
Thus 
£6) _ -k e 
f(s) s-a gls) 
This proves the lemma since g'(s)/g(s) is analytic at a. fm] 


Theorem 13.8 The function 
g(s) 1 
Si Ri Users —1 

is analytic at s — 1. 
Pnoor. By Lemma 4, —£'(s)/f(s) has a first order pole at 1 with residue 1, 
as does 1/(s — 1). Hence their difference is analytic at s — 1. E] 


Theorem 13.9 For x > 1 we have 
2 o z 
Vie) HC t, 5) x zÍ h(1 + ie" oe dt, 

x 2 x 22 J-a 
where the integral |2,. |h(1 + it)| dt converges. Therefore, by the Riemann- 
Lebesgue lemma we have 
(24) y(x) ~ x72 
and hence 


W(x) ~ x asx œ. 
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Proor. In Theorem 13.3 we proved that if c > 1 and x > 1 we have 


We) i -Sy-z osea. 


x 2 x 2nl J.—ot 


where 


1 (s) 1 ) 
Lp e ( Qs) s-l1/ 
Our first task is to show that we can move the path of integration to the line 


c = 1. To do this we apply Cauchy's theorem to the rectangle R shown in 
Figure 13.3. The integral of x^^ '!h(s) around R is 0 since the integrand 


^n 1T c*iT 


— 0 


1-iT c-iT 
Figure 13.3 


is analytic inside and on R. Now we show that the integrals along the 
horizontal segments tend to 0 as T > oo. Since the integrand has the same 
absolute value at conjugate points, it suffices to consider only the upper 
segment, t = T. On this segment we have the estimates 

1 


1 1 
ss + ne po and Cee HD 


Also, there is a constant M such that 1C G)/Gs)| < M log? t if c > 1 and 
t > e. Hence if T > e we have 


(ien 
Suy 


M log? T 
Ii] « —3— 


so that 


ies 1h(s) ds 
1 


M log? log? 
s [e 2 T qe - My ao 1). 
1 T F 
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Therefore the integra!s along the horizontal segments tend to 0 as T > oo, 
and hence we have 


c oi 3 toi 
f x !h(s) ds = i x th(s) ds. 
c- ci 1-ci 
On the line c = 1 we write s = 1 + it to obtain 
1 itoi a eed 
ye Í x th(s) ds = xl h(i + it)e? 8* qr. 
2ni J1- si mLa 


Now we note that 


[na ema f +f +f , 


In the integral from e to oo we have 


M log? t 
MERNE T 


so f? |h(1 + it)| dt converges. Similarly, fZ% converges, so fe. IAC + it)| dt 
converges. Thus we may apply the Riemann-Lebesgue lemma to obtain 
V, (x) ~ x^/2. By Theorem 13.1 this implies y(x) ~ x as x > oo, and this 
completes the proof of the prime number theorem. o 


13.8 Zero-free regions for ¢(s) 


log? t which we proved in Theorem 13.7 for 
ded to the left of the line c = 1. The estimate is 
rip but rather in a region somewhat like that 
e left boundary curve approaches the line 
The inequality implies the nonvanishing of 


The inequality |1/£(s)] < M 
c > landt > e can be exten 
not obtained in a vertical st 
shown in Figure 13.2 where th 
o = 1 asymptotically as t > ©. 
((s) in this region. More precisely, we have: 


Theorem 13.10 Assume o > 1/2. Then there exist constants A > 0 and C > 0 


such that 
it)| > Ge 
[Ao hd log! t 
whenever 
(25) los EYES and t > e. 
og 


This implies that (a + it) # 0 ifc and t satisfy (25). 
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Pnoor. The triangle inequality, used in conjunction with Theorem 13.7, 
gives us 


(26) \Co + it)| = 1E + it)| — [Z0 + it) — Go + in) 
B H P 
= Tam ICO + it) — Uo + it)|, 
for some B > 0. To estimate the last term we write 


IE + it) — e in| = 


few + it) du 


geri 
< | |C'(u + it)| du. 


Sincet > e we have log? t > log t so 1 — (A/log? t) > 1 — (A/log t). Thus, if 
c satisfies (25) for any A > 0 we can apply Theorem 13.4 to estimate 
|C(u + it)|, giving us 
A MA 
: : p 2 2 E 
\C(1 + it) — člo + it) < M(1 — o)log? t < M log? t foe? T og r 
Using this in (26) we find 
; B — MA 
lelo + it)| > xs 

This holds for some B > 0, any A > 0 and some M > 0 depending on A. A 
value of M that works for some A also works for every smaller A. Therefore 
we can choose A small enough so that B — MA > 0, If we let C = B — MA 


the last inequality becomes |{(o + it)| > C log"? t which proves the theorem 
for all o and t satisfying 


A 
———32« 0 1 and t > e. 
log? t 


But the result also holds for c = 1 by Theorem 13.7 so the proof is complete. 
E 
We know that ¢(s) 4 0 if o > 1, and the functional equation 


(s) = 2(2n)' FU — osin( ya - sg 


shows that ¢(s) # 0 if o < 0 except for the zeros at s = —2, —4, —6,... 
which arise from the vanishing of sin(z5/2). These are called the “trivial” 


zeros of ¢(s). The next theorem shows that, aside from the trivial zeros, 
t(s) has no further zeros on the real axis. 


Theorem 13.11 Jf o > 0 we have 


v (. 1yn-1 
(27) a 2-9) = y ( 1y l 
n=1 


n 
This implies that ((s) < 0 ifs is real and 0 < s < 1. 
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Pnoor. First assume that c > 1. Then we have 


ea el 
1 — 2157508) = =-2) = 
( X9) X n 2 (2nY 
—2(142754374--)—2070-47-67--) 
em-254:3-755—4-5r:5—6 I 


which proves (27) for ¢ > 1. However, if c > 0 the series on the right con- 
verges, so (27) also holds for c > 0 by analytic continuation. 

When s is real the series in (27) is an alternating series with a positive sum. 
If 0 < s < 1 the factor (1 — 217°) is negative hence ¢(s) is also negative. o 


13.9 The Riemann hypothesis 


In his famous 8-page memoir on z(x) published in 1859, Riemann [58] 
stated that it seems likely that the nontrivial zeros of ¢(s) all lie on the line 
o = 1/2, although he could not prove this. The assertion that all the non- 
trivial zeros have real part 1/2 is now called the Riemann hypothesis. In 1900 
Hilbert listed the problem of proving or disproving the Riemann hypothesis 
as one of the most important problems confronting twentieth century 
mathematicians. To this day it remains unsolved. 

The Riemann hypothesis has attracted the attention of many eminent 
mathematicians and a great deal has been discovered about the distribution 
of the zeros of ¢(s). The functional equation shows that all the nontrivial 
zeros (if any exist) must lie in the strip 0 < o < 1, the so-called *critical 
Strip." It is easy to show that the zeros are symmetrically located about the 
real axis and about the “critical line" c = 1/2. 

In 1915 Hardy proved that an infinite number of zeros are located on the 
critical line. In 1921 Hardy and Littlewood showed that the number of zeros 
on the line segment joining 1/2 to (1/2) iT is at least AT for some positive 
constant A, if T is sufficiently large. In 1942 Selberg improved this by showing 
that the number is at least AT log T for some A > 0. It is also known that 
the number in the critical strip with 0 < t < T is asymptotic to T log T/2z 
as T — co, so Selberg’s result shows that a positive fraction of the zeros lie 
on the critical line. Recently (1974) Levinson showed that this fraction is 
at least 7/10. That is, the constant in Selberg's theorem satisfies A > 7/20n. 

Extensive calculations by Gram, Backlund, Lehmer, Haselgrove, Rosser, 
Yohe, Schoenfeld, and others have shown that the first three-and-a-half 
million zeros above the real axis are on the critical line. In spite of all this 
evidence in favor of the Riemann hypothesis, the calculations also reveal 
Certain phenomena which suggest that counterexamples to the Riemann 
hypothesis might very well exist. For a fascinating account of the story of 
large-scale calculations concerning t(s) the reader should consult [17]. 
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13.10 Application to the divisor function 


The prime number theorem can sometimes be used to estimate the order of 
magnitude of multiplicative arithmetical functions. In this section we use it to 
derive inequalities for d(n), the number of divisors of n. 


In Chapter 3 we proved that the average order of d(n) is log n. When n is 
prime we have d(n) = 2 so the growth of d(n) is most pronounced when n 
has many divisors. Suppose n is the product of all the primes <x, say 


(28) n-22-3.5-- pa. 
Since d(n) is multiplicative we have 
d(n) = d(2)d(3) - -- dp.) = 279, 
For large x, n(x) is approximately x/log x and (28) implies that 


logn = Ylogp-—9(x) ex 
psx 
so 2"? is approximately 2 !°8”/ log logn, Now 


24 logn — e logn log2 =n" log2 


hence 2 le£^/ log logn — „ log 2/ log logn 


- In other words, when n is of the form (28) 
then d(n) is approximately 2 loe"! log logn _ „ log 2/ log logn, 


By pursuing this idea with a little more care we obtain the following 
inequalities for d(n). 
Theorem 13.12 Let £ > 0 be given. Then we have: 
(a) There exists an integer N(c) such that n > N(e) implies 
d(n) < 20 *9 logn/ log logn — | (15) log 2/ log logn, 
(b) For infinitely many n we have 


d(n) > 20-9 logn/ log logn (1 ~e) log 2/ log logn, 


Note. These inequalities are equivalent to the relation 


3 log d(i)log log n 
lim sup —— —— S = Jog 2, 

pst logn 5 
Pnoor. Write n = p,“ ..- p, so that d(n) = [Ti (a; + 1). We split the 
product into two parts, separating those prime divisors < f(n) from those 
= f(n), where f (n) will be specified later. Then d(n) = P,(n)P(n) where 


Pi(n)= [[ @+1) and Pn) = IH (+). 


Pi< f(n) pi2 fin) 
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In the product P;(n) we use the inequality (a + 1) < 2* to obtain P5(n) < 
259 where 


Sm= a. 
pis fen 
Now 
n= [pz I pt = ID seo feo. 
[ESI Piz f(n) Piz fin) 
hence 


_logn_ 
log n > S(n)log f(n) or S(n) < RICE 

This gives us 

en P,(n) 2 ^e" log fin) 


To estimate P,(n) we write 
P,(n) = exp Y, log(a; + o} 
Pi< f(n) 
and show that log(a; + 1) < 2 log log n if n is sufficiently large. In fact, 


we have 
n> p 22“ 


hence 
log n > a; log 2, or a; < log n/log 2. 
Therefore 


Last + ees < logn ifn >n 


for some n,. Thus n >n, implies log(1 + a) < log(log n)? = 2 log log n. 


This gives us 


P,(n) < exp? loglogn Y i} < exp{2 log log nz(f(n))}. 
pi<S(n) 

Using the inequality (x) < 6x/log x (see Theorem 4.6) we obtain 

‘ 12f (n)log log 2 = 2¢F(n) log log n] log f(n) 

30) P,(n) < epf log TC) 

where c = 12/log 2. Combining (29) and (30) we obtain d(n) = P,(n)P2(n) < 


a(n) 
pe i , efi) 1og loan 


logn + cf (mlog logn _ logn logn 
gn) =e ft) — loglogn Tog f(r) 
log log n 
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Now we choose f(n) to make f(n)log log n/logn— 0 and also to make 
log f(n)/log log n > 1 as n > œ. For this it suffices to take 


logn 
Ja) = (log log n ` 
Then 
logn 1+ o(1) logn 1 log n 
= = + o(1)) < (1 + € 
a(n) loglogn 1 + o(1) log TA em et ) 


log log n 


if n > N(e) for some N(c). This proves part (a). 

To prove part (b) we pick a set of integers n with a large number of prime 
factors. In fact, we take n to be the product of all the primes « x. Then 
n = œ if and only if x ^ oo. For such n we have, by the prime number 
theorem, 

d(n) = 279 = 20 * ox] logx 


Also for such n we have 


logn = J, log p = 9(x) = x(1 + o(1) 
PSx 
so 


logn e 


"Te a (1 + o(1))log n 


hence 


log x = log log n + log(1 + o(1)) = log log n| 1 + legt + o(1)) 
log log n 
= (1 + o(1))log log n. 
Therefore x/log x = (1 + o(1))log n/log log n and 


d(n) = 20 +o(1)) logn/ log logn 


for such n. But 1 + o(1) > 1 — eifn > N(e) for some N(e), and this proves (b). 


Note. As a corollary of Theorem 13.12 we obtain the relation 


(31) d(n) = o(n?) 


for every ô > 0. This result can also be derived without the use of the prime 
number theorem. (See Exercise 13.13.) 
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13.11 Application to Euler’s totient 


The type of argument used in the foregoing section can also be used to 
obtain inequalities for (n). When n is prime we have y(n) = n — 1. When n 
has a large number of prime factors (n) will be much smaller. In fact, if n 
is the product of all primes <x we have 


1 
em =a (1-3) 


The next theorem gives the asymptotic behavior of this product for large x. 


Theorem 13.13 There is a positive constant c such that, for x > 2, 


1 c 1 
5 Mis) ae ola): 


psx 


Note. It can be shown that c = e^, where C is Euler’s constant. (See [31].) 
Proor. Let P(x) denote the product in (32). Then log P(x) = Ys. log(1 — 1/p). 
To estimate this sum we use the power series expansion 
—log(1 e e ee m Ee (It] < 1) 
£ 2*8 n 


with t = 1/p. Transposing one term we find, with a, = —log(1 — 1/p) — 1/p, 
1 1 1/51 1 ) 1 
wae ap aire el pe obe Sa 
Oca itu tL mn 
This inequality shows that the infinite series 


: 1 1 
-yl-tg1-2]-- 
(33) L a, L | zi 5) 2 
converges, since it is dominated by Y; I/n(n — 1). If B denotes the sum of 


the series in (33) we have 


1 1 il 1 
0<B- yap Lez 2 n(n — 1) = zE- 


psx nzx 


Hence 


XYa-2B-* of) 


psx 


or 
log P(x) lee) 
i ERIS xj: 
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But by Theorem 4.12 the sum on the right is log log x + A + O(1/log x) so 


1 
log P(x) = —log log x — B — A + (sex): 
Therefore 


P(x) = exp(log P(x)} = e73-4e-~ log togx,0(1/ log x), 


Now let c = e7^-^ and use the inequality e" = 1 + Olu) for 0 < u < 1 to 


obtain 
[2 1 c 1 
BG) log x fı ja es 3 ~ logx e es j 


This completes the proof. 


Oo 
Theorem 13.14 Let c be the constant of Theorem 13.13, and let £ > 0 be given. 
(a) There exists an N(e) such that 


cn 
oln) > (1— icm for all n > N(e). 


(b) For infinitely many n we have 


gin) x (1 - ) —— 
log log n 
In other words, 
lim inf 


no 


o(n)log log n my 
" : 


Proor. We prove part (b) first. Take n = Is. p. Then 


gln) _ 1i g 1 
mos ju: a ~ Togx * O(a j 


But log n = 9(x) = (1 + o(1))x, so log log n = (1 + o(1))log x, hence 
9(n) c(l + o(1)) 1 c(1 4- o(1)) c 
Emm +0 

n log log n (log log n)? log log n SAER log log n 

ifn > N(e) for some N(e). This proves (b). ) 

To prove (a) take any n > 1 and write 


where 


1 
P,(n) = Il ( -j and P,(n)= J] (1-5). 


pin 
PS logn l 
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Then 

34 P ja fa 

sie - uda bd .— 
os 


where f (n) is the number of primes which divide n and exceed log n. Since 


nz]l»» J] pz (log n)? 
EU ko 


we find logn > f(n)log log n, so f(n) < log n/log log n. Since 1 — (1/log n) 
« 1, inequality (34) gives us 


4 ] \ log% log logn | \ o8) 1/ log logn 
35) P ——— =) (= 
e» Pa) = ( log ;) ( log ; | 


Now (1 — (1/u))* ^ e^! as u oo so the last member in (35) tends to 1 as 
n > œ. Hence (35) gives us 


P3(n)> 1-4 o(1) asn- œ. 


Therefore 
1 
90) _ p (n)P4(n) > (1 + oft) T ( F D) z (0) II ( < 3 
n pin D. ps logn P 
p< logn 
c c 
=(1+ oll) op Toga” + o) > 0-9 log log n 


if n > N(c). This proves part (a). 


13.12. Extension of Pólya's inequality for 
character sums 


We conclude this chapter by extending Pólya's inequality (Theorem 8.21) to 
arbitrary nonprincipal characters. The proof makes use of the estimate for 
the divisor function, 


d(n) = O(n’) 
obtained in (31). 


Theorem 13.15 If y is any nonprincipal character mod k, then for all x > 2 


we have 


Y. am) = O(/k log k). 


msx 
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Pnoor. If y is primitive, Theorem 8.21 shows that 


Y. xm) < Jk log k. 


msx 


Now consider any nonprincipal character y mod k and let c denote the 
conductor of y. Then c|k, c < k, and we can write 


xm) = V(m)y (m) 


where x, is the principal character mod k and V is a primitive character 
mod c. Then 


Yxm- Y m= Yum Y, ud = È 2 uam) 
msx sx msx d|(m, k) msx E 


m 
(m,k)=1 


= Yuld) Y, Wad) = X aya) Y, va). 
djk qsxjd dk qSxld 


Hence 


(36) 


Xx ES Y laan | » vo) < V/c log cy. |u(d)(d)| 
msx d|k [EET dik 


because is primitive mod c. In the 


last sum each factor |u(d)y(d)| is either 
O or 1. If [i(d)U(d)| = 1 then |u(d)] 


= 1 sod is a squarefree divisor of k, say 


d = pypa ++: p,- 
Also, |y(d)| = 


1 so (d, c) = 1, which means no prime factor p; divides c. 
Hence each p; 


divides k/c so d divides k/c. In other words, 


YIuayd) s Y1= a(*) E (5) 
lk dlkjc c e 


for every ô > 0. In particular, d(k/c) = Oklo) so (36) implies 
Yam) = of E Jc log e) = O(/k log c) = O(/k log k). I 
msx 


Exercises for Chapter 13 


1. Chebyshev proved that if V(x)/x tends to a limit as x — co then this limit equals 1. 
A proof was outlined in Exercise 4.26. This exercise outlines another proof based 
on the identity 


(37) es Er (c > 1) 

given in Exercise 11.1(d). 

(a) Prove that (1 — S)C(s)/E(s) 
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(b) Let 6 = lim sup((x)/x). Given € > 0, choose N = N(e) so that x > N implies 


W(x) < (6 + £)x. Keep s real, 1 < s < 2, split the integral in (37) into two parts, 
f + f$, and estimate each part to obtain the inequality 


_ os) s(5 + £) 
Qs) < C(e) + NS 


, 


where C(e) is a constant independent ‘of s. Use (a) to deduce that 5 > 1. 
(c) Let y = lim inf(y(x)/x) and use a similar argument to deduce that y < 1. 


xo 


Therefore if J(x)/x tends to a limit as x — oo then y = ô= l. 
2. Let A(x) = Pn <x a(n), where 


0 ifn a prime power, 


a(n) = 


l= 


ifn = p*. 


Prove that A(x) = n(x) + Ox log log x). 


3. (a) If c > 1 and x # integer, prove that if x > 1, 


L [tw coy ds = nt) n2) n) 


2ni Jeet 
(b) Show that the prime number theorem is equivalent to the asymptotic relation 
*oi P x 
oie log ((s) = ds ~ —— as x > oo. 
Ti Je oi s logx 
A proof of the prime number theorem based on this relation was given by Landau 
in 1903. 


4. Let M(x) = Yasx H(n). The exact order of magnitude of M(x) for large x is not 
known. In Chapter 4 it was shown that the prime number theorem is equivalent to 
the relation M(x) = o(x) as x > co. This exercise relates the order of magnitude of 


M(x) with the Riemann hypothesis. 
Suppose there is a positive constant 0 such that 


M(x) = O(x®) for x > 1. 
Prove that the formula 


——dx, 


1. [PMO 
1 xut 1 k 

xercise 11.1(c)) would also be valid for ¢ > 6. Deduce 

this shows that the relation M(x) = O(x!? **) 

hypothesis. It can also be shown that the 

1/2*®) forevery £ > 0.(See Titchmarsh [69], 


which holds for ø > 1 (see E 
that ¢(s) # 0 for o > 6. In particular, 
for every e > 0 implies the Riemann 
Riemann hypothesis implies M(x) = O(x 
p. 315.) 
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5. Prove the following lemma, which is similar to Lemma 2. Let 


A(x) = puo. 
0 R 


where A(u) is a nonnegative increasing function for u > 1. If we have the asymptotic 
formula 


A(x) ~ Lx‘ as x œ, 
for some c > 0 and L > 0, then we also have 


A(x) ~ cLx* asx oo. 
6. Prove that 


1 2+ai s 
zi] Yds=0 if0<y<1, 
Tu y 


2-«i S 


What is the value of this integral if yz1? 


1 2+oi xt Cs) : 
UE al- EE 


as a finite sum involving A(n). 


7. Express 


8. Let y be any Dirichlet character mod k with y, the principal character. Define 


L " L 
F(a, t) = 3— (e, +4— it, = it, y?). 
(a, t) L' x) T+ ix) T. + 2it, y?) 


If c > 1 prove that F(ø, t) has real part equal to 


^ 
= Y a Re{3x,(n) + 4y(n)n-" + P(n)n-24 


and deduce that Re F(a, t) x 0. 


9. Assume that L(s, 


x) has a zero of order m > 1 at s = 1 + it. Prove that for this t 
we have: 


D 


L r m 
(a) per it, x) 2 — 


+ O(1) asc 1+, 
a-1 


and 


(b) there exists an integer r > 0 such that 
LI 


L 
— (c + 2it, y?) LP ERE O(l) asc 14, 
L o-1 

except when x? = y, and t = 0. 


10. Use Exercises 8 and 9 to prove that 


L( ity) #0 forallrealtify? + x 
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and that 
L(1 + it, 3) #0 forall real t Oif y? = y,. 
[Hint: Consider F(a, t) as c > 1+.] 
11. Foranyarithmetical functions (n), prove that the following statements are equivalent: 


(a) f(n) = O(n’) for every € > O and all n > n,. 
(b) f(n) = o(n’) for every ô > 0 as n > co. 


12. Let f (n) be a multiplicative function such that if p is prime then 
f(p")-—0 asp" co. 


That is, for every € > 0 there is an N(z) such that | f(p")| < € whenever p" > N(e). 
Prove that f (n) + 0 as n — co. 

[Hint: There is a constant A > 0 such that | f(p")| < A for all primes p and all 
m > 0, and a constant B > 0 such that | f(p”)| < 1 whenever p" > B.] 


13. If a > 0 let o,(n) = Yj, d". Prove that for every ô > 0 we have 
c,(n) = o(n***) asn— œ. 


[Hint: Use Exercise 12.] 
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14.1 Introduction 


A = (25,45, 25 


where the elements a; are special numbers such as primes, squares, cubes, 
triangular numbers, etc. Each representation of n as a sum of elements of A 
is called a partition of n and we are interested in the arithmetical function 
A(n) which counts the number of partitions of n into summands taken from A. 
We illustrate with some famous examples. 


Goldbach conjecture Every even n » 4 is the sum of two odd primes. 


In this example A(n) is the number of solutions of the equation 
(1) n—p,- pz, 


where the D; are odd primes. Goldbach's assertion is that A(n) > 1 for even 
n > 4. This conjecture dates back to 1742 and is undecided to this date. 
In 1937 the Russian mathematician Vinogradov proved that every sufficiently 
large odd number is the sum of three odd primes. In 1966 the Chinese 
mathematician Chen Jing-run proved that every sufficiently large even 


number is the sum of a prime plus a number with no more than two prime 
factors. (See [10].) 
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Representation by squares For a given integer k > 2 consider the partition 
function r,(n) which counts the number of solutions of the equation 
(2) n=x te + x2, 


where the x; may be positive, negative or zero, and the order of summands 
is taken into account. 


For k = 2, 4, 6, or 8, Jacobi [34] expressed r,(n) in terms of divisor func- 
tions. For example, he proved that 


r2(n) = 4{d,(n) — d3(n)}, 


where d,(n) and d3(n) are the number of divisors of n congruent to 1 and 3 
mod 4, respectively. Thus, r;(5) = 8 because both divisors, 1 and 5, are 
congruent to 1 mod 4. In fact there are four representations given by 


522 41? =(-2 +1? =(-2? + (1h = 2 + (21), 


and four more with the order of summands reversed. 
For k = 4 Jacobi proved that 


ra(n) = } d = 8e(n) if nis odd, 
din 


= 24) d ifnis even. 
dasa 
The formulas for re(n) and rg(n) are a bit more complicated but of the same 


general type. (See [14].) 
Exact formulas for r,(n) have also been found for k = 3, 5, or 7; they 


involve Jacobi’s extension of Legendre’s symbol for quadratic residues. 
For example, if n is odd it is known that 


r3(n) = 24 Y, (m|n) ifn =1 (mod 4) 
msn/4 


=8 Y (m|n ifn =3 (mod 4), 


msn[2 


where now the numbers x;, x2, xs in (2) are taken to be relatively prime. 

For larger values of k the analysis of r,(n) is considerably more complicated. 
There is a large literature on the subject with contributions by Mordell, 
Hardy, Littlewood, Ramanujan, and many others. For k > 5it is known that B 
7,(n) can be expressed by an asymptotic formula of the form 


(3) rn) = p.n) + Rin), 
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where p,(n) is the, principal term, given by the infinite series 


wln? e q (D) - m 


pin) = un c a q 
r 5 (h.g=1 


and R,(n) is a remainder term of smaller order. The series for p,(n) is called 
the singular series and the numbers G(h; q) are quadratic Gauss sums, 


q 
Gh; 4) = X emih, 
r=1 


In 1917 Mordell noted that r,(n) is the coefficient of x" in the power series 
expansion of the kth power of the series 


The function 9 is related to elliptic modular functions which play an im- 
portant role in the derivation of (3). 


Waring's problem To determine whether, for a given positive integer k, there 
is an integer s (depending only on k) such that the equation 


(4) n2 xj xe xl 


has solutions for every n 2 1. 


The problem is named for the English mathematician E. Waring who 
stated in 1770 (without proof and with limited numerical evidence) that every 
nis the sum of 4 squares, of 9 cubes, of 19 fourth powers, etc. In this example 
the partition function A(n) is the number of solutions of (4), and the problem 
is to decide if there exists an s such that A(n) > 1 for all n. 

If s exists for a given k then there is a least value of s and this is denoted by 
g(k). Lagrange proved the existence of g(2) in 1770 and, during the next 139 
years, the existence of g(k) was shown for k — 3, 4, 5, 6, 7, 8 and 10. In 1909 
Hilbert proved the existence of g(k) for every k by an inductive argument 
but did not determine its numerical value for any k. The exact value of g(k) 
is now known for every k except k — 4. Hardy and Littlewood gave an 
asymptotic formula for the number of solutions of (4) in terms of a singular 


series analogous to that in (3). For a historical account of Waring's problem 
see W. J. Ellison [18]. 


Unrestricted partitions 


One of the most fundamental problems in additive number theory is that 
of unrestricted partitions. The set of summands consists of all positive 
integers, and the partition function to be studied is the number of ways ” 
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can be written as a sum of positive integers « n, that is, the number of 
solutions of 
(5) n=a, +a +- 


The number of summands is unrestricted, repetition is allowed, and the 
order of the summands is not taken into account. The corresponding partition 
function is denoted by p(n) and is called the unrestricted partition function, 
or simply the partition function. The summands are called parts. For example, 
there are exactly five partitions of 4, given by 


4=341=24+2=24+14+1=14+14+141, 
so p(4) = 5. Similarly, p(5) = 7, the partitions of 5 being 
S=4+1=3+2=34+14¢1=24+2+1 2414141 
=1+1+14+1+1. 
The rest of this chapter is devoted to a study of p(n) and related functions. 


14.2 Geometric representation of partitions 


z 
There is a simple way of representing partitions geometrically by using a 
display of lattice points called a graph. For example, the partition of 15 given 
by A 
64+34+34+2+1 


can be represented by 15 lattice points arranged in five rows as follows: 


If we read this graph vertically we get another partition of 15, 
54443414141. 


d to be conjugate. Note that the largest part in 


Two such partitions are sai 
ual to the number of parts in the other. Thus we 


either of these partitions is eq 
have the following theorem. 


Theorem 14.1 The number of partitions of n into m parts is equal to the number of 
Partitions of n into parts, the largest of which is m. 


Several theorems can be proved by simple combinatorial arguments 
involving graphs, and we will return later to a beautiful illustration of this 
method. However, the deepest results in the theory of partitions require 


a more analytical treatment to which we turn now. 
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14.3 Generating functions for partitions 


os 
A function F(s) defined by a Dirichlet series F(s) = Y, f(n)n ^ is called a 
generating function of the coefficients f (n). Dirichlet series are useful generat- 
ing functions in multiplicative number theory because of the relation 

n ^m * = (nm) ^. 

“4 d 

In additive number theory it is more convenient to use generating functions 
represented by power series, 


F(x) = », f(x" 


because x"x" = x"*". The next theorem exhibits a generating function for 
the partition function p(n). 


Theorem 14.2 Euler. For |x| « 1 we have 


E 1 co 
Il, Ste a Xp à 


wheré p(0) — 1. 
Pnoor. First we give a formal derivation of this identity, 
of convergence, then we give a more rigorous proof. 


If each factor in the product is expanded into a power series (a geometric 
Series) we get 


LÀ 
1 
Il jg -ütxex 630 €x xt et th Ee. 


ignoring questions 


Now we multiply the series on the right, treating them as though they were 
polynomials, and collect like powers of x to obtain a power series of the form 


1+ Y ats. 
k=1 


We wish to show that a(k) = p(k). Suppose we take the term x*' from the 
first series, the term x?* from the second, the term x?^ from the third,...,and 
the term x”*m from the mth, where each k; 2 0. Their product is 


ahi kayka... mkm yk 


Say, where 
k= k, + 2k; + 3k3 + --- + mk 
This can also be written as follows: 


H 


- 


m* 


All esas Oe SE va oo ee ee ee ae) 
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- 
where the first parenthesis contains k, ones, the second k; twos, and so on. 
This is a partition of k into positive summands. Thus, each partition of k 
will produce one such term x* and, conversely, each term x* comes from a 
corresponding partition of k. Therefore a(k), the coefficient of x^, is equal to 
p(k), the number of partitions of k. 

The foregoing argument is not a rigorous proof because we have ignored 
questions of convergence and we have also multiplied together infinitely 
many geometric series, treating them as though they were polynomials. 
However, it is not difficult to transform the above ideas into a rigorous proof. 

For this purpose we restrict x to lie in theinterval0 < x < 1and introduce 


two functions, 


Fx) = [I 

k= 

The product defining F(x) converges absolutely if 0 < x < 1 because its 

reciprocal [ [(1 — x*) converges absolutely (since the series }, x* converges 

absolutely). Note also that for each fixed x the sequence {F,,(x)} is increasing 
because 


el Pe s ‘i 
— d F(x) = [| —— = lim F9. 
| and Fo) = TT 4— 7 Bim (x) 


2 
Lx k=1 


Fm+1(%) = ot Fal > F(x). * 


Thus F,(x) < F(x) for each fixed x, 0 < x < 1, and every m. Now F,,(x) 
is the product of a finite number of absolutely convergent series. Therefore 
it, too, is an absolutely convergent series which we can write as’ 


F,(x)-21- Y po (K)x*. 
k=1 


Here p,,(k) is the number of solutions of the equation 
k =k, + 2k, + +-+ + mk, 


In other words, p,,(k) is the number of partitions of k into parts not exceeding 
m. If m > k, then p,(k) = p(k). Therefore we always have 


p,(k) < pík) 
with equality when m > k. In other words, we have 
lim p,(k) = p(k). 


m>o 


Now we split the series for F, m(x) into two parts, 


F,(x) = È pall + X pelle 
k= k=m+ 


F phe Y plk. 
k=0 k=m+1 


14: Partitions 


Since x > 0 we have " 


Y plkx* < Felt) < FO). 
k=0 


This shows that the series 5. p(k)x* converges. Moreover, since p,(k) < 
p(k) we have 


5 Pmlk)x* < ¥ plk) < F(x) 
k=0 k=0 


so, for each fixed x, the series Y Pmlk)x* converges uniformly in m. Letting 
m oo we get 


F(x) = lim F,(x) = lim Y p (k)x* = S lim p,(x)x* — y p(k)x*, 
k=0 


m-*o m-o k-0 k=0 mca 


which proves Euler’s identity for 0 < x X 1. We extend it by analytic 
continuation to the unit disk |x] « 1. D 


Table 14.1 Generating functions 


š The number of partitions of n 
Generating function into parts which are 
————— eee 

E] i^ 

I 1 pa 

LJ ji 

Ma er even 

e = 1 

TI R squares 
m=1 um 
IL 1 " 

primes 
» PX 

II a +x" unequal 
m=1 

IIa 47-5 odd and unequal 

(m-1 
(1 + x?7) even and unequal 

m=1 

IIa 4 x7) distinct squares 
m=1 
IIa + x") distinct primes 
P 
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By similar arguments we can readily find the generating functions of many 
other partition functions. We mention a few examples in Table 14.1. 


14.4 Euler’s pentagonal-number theorem 


We consider next the partition function generated by the product [ [(1 — x"), 
the reciprocal of the generating function of p(n). Write 


I (ü—x")21-4 DEOS 
m=1 n=1 


To express a(n) as a partition function we note that every partition of n 
into unequal parts produces a term x” on the right with a coefficient +1 or 
— 1. The coefficient is + 1 if x" is the product of an even number of terms, and 
— 1 otherwise. Therefore, 


a(n) = p(n) — po(n), 


where p,(n) is the number of partitions of n into an even number of unequal 
parts, and p,(n) is the number of partitions into an odd number of unequal 
parts. Euler proved that p,(n) = p,(n) for all n except those belonging to a 


special set called pentagonal numbers. 
The pentagonal numbers 1, 5, 12, 22, ... were mentioned in the Historical 


Introduction. They are related to the pentagons shown in Figure 14.1. 


a Sian 


1 1+4=5 14+44+7=12 1*4*7*10222 


Figure 14.1 


These numbers are also the partial sums of the terms in the arithmetic 
progression 
1, 4, 7, 10, 13,.. ., 3n 1,... 


If on) denotes the sum of the first n terms in this progression then 


3n(n — 1) _ 3 —-n 
2 ——S ~~ oe 


+ 
á 2 


dhe) = 5 Ok +) 
k=0 


The numbers on) and œ(—n) = (3n? + n)/2 are called the pentagonal 
numbers. 
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Theorem 14.3 Euler's pentagonal-number theorem. If |x| < 1 we have 


tla x") = 1 — xX — x? + XP Hx? — x12 — gS ga 
m=1 
o œ% 
ede » (— 1 (x99 4 xem) = y» (— 1)", 
n-1 n--—co0 
Pnoor. First we prove the result for 0 < x < 1 and then extend it to the disk 
|x| < 1 by analytic continuation. Define Po = So = 1 and, for n > 1, let 


P, = [Itt - x) and $, — 1 + Y (—1y (x9 + xe ny 
r=1 r=1 
The infinite product Ta — x”) converges so P, > pla — x") as n> oo. 
We will prove (using a method of Shanks [63]) that 
(6) |S, — P,| € nx"*!, 


Since nx"*! > 0 as n > co this will prove Euler’s identity for 0 < x « 1. 
To prove (6) we let g(r) = r(r + 1)/2 and introduce the sums 


= P 

F,- Y(- 

Eu IE 

We show first that F, is a disguised form of Sa. It is easily verified that 
Fı = S, = 1 — x — x? Therefore, if we show that 


xao 


Fa — Fy-4 = Sn — Sy_1, or F, — Sy = Fy-4 — Spo, | 
this will prove that F, = S, for all n > 1. Now l 


n P, n-1 P. e { 
F,— Fy. = AE e = Dy (—-1y ;: x ot) 
r-0 r r=0 r 


In the first sum we write P, = (1 — x")P,_, and separate the term with r = n. 
Then we distribute the difference 1 — x" to obtain 


n-1 
Fa = Fia = Capto y Cay Pe a 
r=0 


r 


Y Prot tate) "SS Pa- (r) 
EN (-1y BIL x(r*im*gü) - (-1y ni xrn- 1)*g(). 
r=0 P, E P, 

Now combine the first and third sums and note that the term with r = 0 
cancels. In the second sum we shift the index and obtain 


n-1 
BUE. (=1x" +o y X (y Pec geo nee 2 
r-1 r 


= Y cay: Pr- xrrtatr— 1) 
"i P,- 
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But (x' — 1)/P, = —1/P,-, and r(n — 1) + g(r) = rn + g(r — 1) so the last 
two sums cancel term by term except for the term with r = n in the second 
sum. Thus we get 


F, cu pr Rz (=1 t E (—1yx**ee- n 
But 


n? + g(n) =n? + ae —e(—n) andn? + gin — 1) = a(n), 


so 
Fp — F7, = (— DN xe pxsc- my Se Sei, 


and hence F, = S, for all n > 1. In the sum defining F, the first term is P, so 
(7) F,=P, +), (ye eae «0 
r=1 r 


Note that 0 < P,/P, < 1 since 0 < x < 1. Also, each factor x™”+9 < xn*1 
so the sum on the right of (7) is bounded above by nx** !. Therefore |F,, — P,| 
< nx"*! and, since F, = S,, this proves (6) and completes the proof of 
Euler’s identity. o 


14.5 Combinatorial proof of Euler’s 
pentagonal-number theorem 


Euler proved his pentagonal-number theorem by induction in 1750. Later 
proofs were obtained by Legendre in 1830 and Jacobi in 1846. This section 
describes a remarkable combinatorial proof given by F. Franklin [22] in 
1881. 

We have already noted that 


n= 


[1a -x= 14 È tpa) — pana, 


where p,(n) is the number of partitions of n into an even number of unequal 
parts, and p,(n) is the number of partitions into an odd number of unequal 
parts. Franklin used the graphical representation of partitions by lattice 
points to show that there is a one-to-one correspondence between partitions 
of n into an odd and even number of unequal parts, so that p,(n) = poln), 
except when n is a pentagonal number. 

Consider the graph of any partition of n into unequal parts. We say the 
graph is in standard form if the parts are arranged in decreasing order, as 
illustrated by the example in Figure 14.2. The longest line segment connecting 
Points in the last row is called the base of the graph, and the nimber of 
lattice points on the base is denoted by b. Thus, b > 1. The longest 45° 
line segment joining the last point in the first row with other points in the 
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e o ooo o " 
AN ANNA ^«— — Slope (s = 4) 
e. 9 € € 

° o o 


e 
PS Base (^ = 2) 


Figure 14.2 


graph is called the slope, and the number of lattice points on the slope is 
denoted by s. Thus, s > 1. In Figure 14.2 we have b = 2 and s = 4. 

Now we define two operations A and B on this graph. Operation A moves 
the points on the base so that they lie on a line parallel to the slope, as 
indicated in Figure 14.3(a). Operation B moves the points on the slope so that 
they lie on a line parallel to the base, as shown in Figure 14.3(b). We say an 
operation is permissible if it preserves the standard form of the graph, that is, 
if the new graph again has unequal parts arranged in descending order. 


-0-0-0 
(a) Operation A (b) Operation B 


Figure 14.3 


If A is permissible we get a new partition of n into unequal parts, but the 
number of parts is one less than before. If B is permissible we get a new 
partition into unequal parts, but the number of parts is one greater than 
before. Therefore, if for every partition of n exactly one of A or B is per- 
missible there will be a one-to-one correspondence between partitions of n 
into odd and even unequal parts, so p,(n) = p,(n) for such n. 


To determine whether A or B is permissible we consider three cases: 
(1) b < s; (2)b =s;(3)b>s. 


Case 1: If b < s then b < s — 1 so operation A is permissible but B is not 
since B destroys the standard form. (See Figure 14.3.) 

Case 2: If b = s, operation B is not permissible since it results in a new 
graph not in standard form. Operation A is permissible except when the 
base and slope intersect, as shown in Figure 14.4(a), in which case the new 
graph is not in standard form. 

Case 3: If b > s, operation A is not permissible, whereas B is permissible 
except when b = s + 1 and the base and slope intersect, as shown in Figure 
14.4(b). In this case the new graph contains two equal parts. 

Therefore, exactly one of A or B is permissible with the two exceptions 
noted above. Consider the first exceptional case, shown in Figure 14.4(a), 
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(a) b=s (b) b=st+1 
Figure 14.4 Neither A nor B is permissible. 


and suppose there are k rows in the graph. Then b = k also so the number 
nis given by 


3k? — k 
5 = lk). 


n=k + (k+ 1) +--+ 2k- 1)= 


For this partition of n we have an extra partition into even parts if k is even, 
and an extra partition into odd parts if k is odd, so 


pn) — pn) = (— 1%. 


In the other exceptional case, shown in Figure 14.4(b), there is an additiv al 
lattice point in each row so 


3k? — k 3k? +k 
nE D +k= 2 Fal o(—k) 
and again p,(n) — pon) = (— 1). This completes Franklin's proof of s 


identity. 


14.6 Euler's recursion formula for p(n) 


Theorem 14.4 Let p(0) = 1 and define p(n) to be 0 if n < 0. Then for n > 1 
we have 
(8) p(n) — p(n — 1) — pn — 2) + pn = 5) + pn - 7) € = 0, 


or, what amounts to the same thing, 


p) = Y (—DE pln — o) + pn — oX- 1). 
k=1 


Pnoor. Theorems 14.2 and 14.3 give us the identity 
( a Epp” Jr 01 5j pois) ST 
k=1 m=0 


If n > 1 the coefficient of x" on the right is 0 so we immediately obtain (8) 
by equating coefficients. H 
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MacMahon used this recursion formula to compute p(n) up to n = 200. 
Here are some sample values from his table. 


pl) =1 
p(5) =7 
p(10) = 42 
p(15) = 176 
(20) = 627 
p(25) = 1,958 
p(30) = 5,604 
p(40) = 37,338 
p(50) = 204,226 


p(100) = 190,569,292 
p(200) = 3,972,999,029,388 


These examples indicate that p(n) grows very rapidly with n. The largest 
value of p(n) yet computed is p(14,031), a number with 127 digits. D. H. 
Lehmer [42] computed this number to verify a conjecture of Ramanujan 
which asserted that p(14,031) = 0 (mod 11*). The assertion was correct. 
Obviously, the recursion formula in (8) was not used to calculate this value 


of p(n). Instead, Lehmer used an asymptotic formulà of Rademacher [54] 
which implies 


eum 
p(n) ~ —= asn> co, 


4n /3 


where K = 2(2/3)'?. For n = 200 the quantity on the right is approximately 
4 x 10'? which is remarkably close to the actual value of p(200) given in 
MacMahon’s table. 


In the sequel to this volume we give a derivation of Rademacher's 
asymptotic formula for p(n). The proof requires considerable preparation 
from the theory of elliptic modular functions. The next section gives a crude 


upper bound for p(n) which involves the exponential ef? and which can be 
obtained with relatively little effort. 


14.7 An upper bound for p(n) 
Theorem 14,5 If n > 1 we have p(n) < e**7, where K = (2/3)? 
Proor. Let 
Fe) = Td ey =14 Y per, 
n=1 k=1 
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and restrict x to the interval 0 < x < 1. Then we have p(n)x" < F(x), from 
which we obtain log p(n) + n log x < log F(x), or 


(9) log p(n) < log F(x) + n log. 


We estimate the terms log F(x) and n log(1/x) separately. First we write 


log F(x) = —log Tha =k) == P — x)= » X = 
© e um = o ib xn 
git PO ) i NICHE 


m=1 


Since we have 
1-~*x 
1-x 


—-1-xc4xbBeeBx", 


and since 0 < x « 1, we can write 


MN per 
mx « = «m, 
and hence 
m(l—x) 1—x" m(l-») 
= <—n Soon 
Inverting and dividing by m we get 
1! ax" 1. aoo 1x 
< 


mics moe een 
Summing on m we obtain 


e poss Mea TU st ox n? 
log FG) = bia 1-34 n 61-x 6t’ 


where 


Note that t varies from oo to 0 through positive values as x varies from 
Oto 1. 

Next we estimate the term n log(1/x). For t > 0 we have log(1 + t) < t. 
But 


1 ony eee ul tog 
den= a ES Se 
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Now 
n? 
6t 
The minimum of (x?/6t) + nt occurs when the two terms are equal, that is, 
when z?/(6t) = nt, or t = 2/,/6n. For this value of t we have 

log p(n) < 2nt = 2nn/,/6n = K./n 


so p(n) < eK, as asserted. 


1 
(10) log p(n) « log F(x) 4- n log- « — 4 nt. 


E 


Note. J. H. van Lint [48] has shown that with a little more effort we can 
obtain the improved inequality 


Kya 
(11) ne 


———— fi le 
p(n) < a orn > 


Since p(k) > p(n) if k > n, we have, for n > 1, 
Ei E n 
Fox) > Y pk > pin) Y x = P0 
k=n k=n T= 
Taking logarithms we obtain, insteady of (9), the inequality 


1 
log p(n) < log F(x) + n log - + log(1 — x). 


Since 1 — x = tx we have log(1 — x) = log t — log(1/x), hence 
(10) can be replaced by 


2 
(12) log p(n) < = + (n — It + logt. 
An easy calculation with derivatives shows that the function 


n? 
ros ta- 1)t + logt 


has its minimum at 


pa ht V1 + A ZO 
¥ 2(n — 1) i 


Using this value of t in (12) and dropping insignificant terms we obtain (11). 


14.8 Jacobi's triple product identity 


This Section describes a famous identity of Jacobi from the theory of theta 
functions. Euler's pentagonal number theorem and many other partition 
identities occur as special cases of Jacobi’s formula. 
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14.8: Jacobi’s triple product identity 


Theorem 14.6 Jacobi’s triple product identity. For complex x and z with 
|x| < 1 and z # 0 we have 


(13) [9 tee i= rti 
n=1 m--—o 

Pnoor. The restriction |x| « 1 assures absolute convergence of each of the 
products [[(1 — x"), [[(1 + x?77!z?), [](1 + x?77!z72), and of the 
series in (13). Moreover, for each fixed x with |x| « 1 the series and products 
converge uniformly on compact subsets of the z-plane not containing z — 0 
so each member of (13) is an analytic function of z for z # 0. For fixed z # 0 
the series and products also converge uniformly for |x| € r « 1 hence 
represent analytic functions of x in the disk |x| « 1. 

To prove (13) we keep x fixed and define F(z) for z 4 0 by the equation 


(14) F(z) = [[Q + x^ z5)ü + xz?) 
n=1 

First we show that F satisfies the functional equation 

(15) xz?F(xz) = F(z). 

From (14) we find 


F(xz) = IIa 4 x2n*iz2) + x?n73272) 
n=1 


= (1 + x?" 12) IIa + x? 12-2), 
m=2 r=0 
Since xz? = (1 + xz2)/(1 + x^!z^?), multiplication of the last equation 


by xz? gives (15). 
Now let G(z) denote the left member of (13) so that 


(16) G(z) = F(z) ia — x?) 


Then G(z) also satisfies the functional equation (15). Moreover, G(z) is an 
even function of z which is analytic for all z # 0 so it has a Laurent expansion 


of the form 
(17) Gz)= Y az" 
E 


m=- 


where a_„ — a, since G(z) = G(z~'). (The coefficients am depend on x.) 
Using the functional equation (15) in (17) we find that the coefficients satisfy 
the recursion formula 


am = x?" lam; 
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which, when iterated, gives 
am = agx" for allm>0 
since 1+3+---+(2m—1)=m’. This also holds for m <0, Hence 
(17) becomes 
(18) G.(z) = ao(x) X xmgms 


where we havewritten G,(z) for G(z) and do(x) for a to indicate the dependence 


on x. Note that (18) implies a(x) > 1 as x + 0. To complete the proof we 
must show that a(x) = 1 for all x. 


Taking z = e"'/^ in (18) we find 


(19) G,fe*!!*) bi oo 


- xm = Y (pow 
a(x) Ex i P Ws 
since i" = — i^" if mis odd. From (18) we see that the series on the right of 
(19) is G,.(i)/ao(x*) so we have the identity 
(20) CAO _ Gadi) 


ax) aof) 
We show next that G,(e*!*) = G. (i). In fact, (14) and (16) gives us 
G(e™"4) = Ila — x^"n(1 + x*n-2). 
Since every even number is of the form 4n or 4n — 2 we have 
Tla — x”) = Ta — x4")(1 — x*»-2) 


so 


Ge!) I IIa = x*" ER 15-2) T. Kar 3) en Ia A x*"ü - X804) 
n=1 n=1 
= fa = x®")(1 = x9"-5) Z. PELL) = Gdi). 


Hence (20) implies ag(x) = a(x*). Replacing x by x*, x**, ..., we find 


o(x) = a(x) fork = 1,2,... 


But x*" — 0 as k > oo and a(x) > 1 as x> 0 so ao(x) = 1 for all x. This 
completes the proof. 


oO 
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14.9 Consequences of Jacobi’s identity 

If we replace x by x^ and z? by x’ in Jacobi's identity we find 
[à — x? + x75 + x27275 — » am? + bm 
n=1 "mz2-o 


Similarly, if z? = —x^ we find 


(1 — x?" (1 — x2n72*5)(1 — x2272-5) — $ (— 1)nxem tim. 
1 


"172-0 


8 


To obtain Euler's pentagonal number theorem simply take a — 3/2 and 
b — 1/2 in this last identity. 

Jacobi's formula leads to another important formula for the cube of 
Euler's product. 


Theorem 14.7 If |x| « 1 we have 


en — [Ja-x*- X Creme 
n=1 m--—o 
= Y (Om + xt, 
m=0 


Pnoor. Replacing z? by —xz in Jacobi's identity we obtain 
IIa - x?")(1 - x3 — x22) = Y (1) tm — z7™-1), 
n=1 m=0 


Now we rearrange terms on both sides, using the relations 


fla 7275-2 - 5I - x7» 
n=1 n-1 
and 
gm gmt (Paz Yd bz 6 a tes zm" 


Canceling a factor 1 — z~* we obtain 


Ta — x?) — x?"z)(1 — peste) 


n=1 
= puse aa a ee gt) 
Taking z = 1 and replacing x by x”? we obtain (21). n 
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14.10 Logarithmic differentiation of 
generating functions 


Theorem 14.4 gives a recursion formula for p(n). There are other types of 
recursion formulas for arithmetical functions that can be derived by logarith- 
mic differentiation of generating functions. We describe the method in the 
following setting. 

Let A bea given set of positive integers, and let f (n) be a given arithmetical 
function. Assume that the product 


Fy) = []( — xe 


ncA 
and the series 


G(x) = » w x 


converge absolutely for |x| < 1 and represent analytic functions in the unit 
disk |x| < 1. The logarithm of the product is given by 


fin) AWS xm omo 
log F4(x) = — Y ——log(1 — x") = - m 
g F (x) 2 h g( ) p " 2 = P3 m 8 ). 
Differentiating and multiplying by x we obtain 
POL en mio WoW 
xy L= Y Yfee-Y Yea, 
EAC) cma m=1 neA m=1n=1 


where x, is the characteristic function of the set A, 


E 1 ifneA, 
TAY 10. n A, 


Collecting the terms with mn — k we find 


XD tar) fn) = ps Salk), 


where 


fa) = PEZON (d) = 2 fa). 


deA 
Therefore we have the following identity, 


(22) XF (x) = F9) Y, fake 
k=1 
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Now write the product F (x) as a power series, 
F4(x) = Y pa, (nx, where pa, (0) = 1, 
n-0 


and equate coefficients of x^ in (22) to obtain the recursion formula (24) in the 
following theorem. 


Theorem 14.8 For a given set A and a given arithmetical function f, the numbers 
Da, sn) defined by the equation 


Q3 [Ta — 97/9 = 1: Y pa e 
n=1 


ned 


satisfy the recursion formula 


en domi È fip, jn - 8, 


where pa, (0) = 1 and 
fal = Y fa). 
djk 
deA 


EXAMPLE | Let A be the set of all positive integers. Iff(n) = n, then p4, (n) = 
p(n), the unrestricted partition function, and f4(k) = o(k), the sum of the 


divisors of k. Equation (24) becomes 


np(n) = Y, o(k)p(n — k), 


k=1 


a remarkable relation connecting a function of multiplicative number 


theory with one of additive number theory. 


s in Example 1, but let f(n) = —n. Then the coefficients 


ExAMPLE 2 Take A a 
ler’s pentagonal-number theorem and the 


in (23) are determined by Eul 
recursion formula (24) becomes 


i n-1 
Q5) np, jn) = — È pa a= = — 20) — 2 pa dota — I, 
k=1 s. 
where 
fs 1" ifnisa pentagonal number om) or o — m) 


pa, s(n) = 1o if n is not a pentagonal number. 
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Equation (25) can also be written as follows: 
a(n) — a(n — 1) — a(n — 2) + e(n — 5) + o(n — 7) —... 
(—1)""*@(m) ifn = am), 
= (7 10"^7o(—m) ifn = o(—m), 
0 otherwise. 


The sum on the left terminates when the term a(k) has k < 1. To illustrate, 
when n — 6 and n — 7 this gives the relations 

9(6) = o(5) + o(4) — o(1), 

9(7) = o(6) + o(5) — o(2) — 7. 


14.11 The partition identities of Ramanujan 
By examining MacMahon's table of the partition function, Ramanujan was 


led to the discovery of some striking divisibility properties of p(n) For 
example, he proved that 


(26) p(5m + 4) = 0 (mod 5), 
(27) p(7m + 5) = 0 (mod 7), 
(28) p(11m + 6) = 0 (mod 11). 


In connection with these discove: 


ties he also stated without proof two 
remarkable identities, 


S meteo 
Q9) 2pm + den = sO, 
and 
x ws PO) ex? 
(30) PL. "Bop 7 “ow + 49x ey 
where 


G 


(x) = J] — x». 


n=1 


Since the functions on the right of (29) and (30) have power series expansions 
with integer coefficients, Ramanujan’s identities immediately imply the 
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Exercises for Chapter 14 
1. Let A denote a nonempty set of positive integers. 


(a) Prove that the product 


IIa x» 
meA 
is the generating function of the number of partitions of n into parts belonging 
to the set A. 
(b) Describe the partition function generated by the product 
II (1 + x”). 
meA 


In particular, describe the partition function generated by the finite product 
[k 
malt 


2. If |x| < 1 prove that 
E E] 
IIa» ma- xiii 
m=1 m=1 


and deduce that the number of partitions of n into unequal parts is equal to the 
number of partitions of n into odd parts. 


3. For complex x and z with |x| < 1, let 
E 


f(x, 2) = [] (l - x72). 


m=1 


edz the product is an analytic function of x in the disk 


(a) Prove that for each fix t 
| < 1 the product is an entire function 


|x| < 1, and that for each fixed x with |x 


ofz. 
(b) Define the numbers a,(x) by the equation 


fœ z= » a,(x)z". 
n=0 


Show that f(x, z) = (1 — xz) fC zx) and use this to prove that the coefficients 
satisfy the recursion formula 
ax) = a (x)X^ — an- 109x". 


(c) From part (b) deduce that a,(x) = (— 1yxe* 9? JP (x), where 


P,(x) = [It - x. 
r=i 
identity for |x| < 1 and arbitrary z: 


Ia -x2- "i Cem 


This proves the following 


m=1 
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4. Use a method analogous to that of Exercise 3 to prove that if Ix| < Land |z|< 1 


we have 

T e L4 z" 
1—x")'= 
II ) x P,(x) 

where P,(x) = []?-, (1 — x’). 

5. If x # 1 let Q(x) = 1 and for n > 1 define 
= 1— xr 
Q,(x) = E lox 


(a) Derive the following finite identities of Shanks: 


2n 


Y xrm112 bi Q(x) yan) 


m=1 s=0 Q(x) , 
2n*1 n 
= Q,(x) 
xm 1/2 xin) 
X À Q(x) 
(b) Use Shanks’ identities to deduce Gauss’ triangular-number theorem: 
o o 1-x* 
Y xm 1/2 = TI are for |x| « 1. 
m=1 n=1 


6. The following identity is valid for Ix] <1: 


57 xmotiy2 [Ta + x"! = x"), 


m--o 


(a) Derive this from the identities 


in Exercises 2 and 5(b). 
(b) Derive this from Jacobi's tripl 


e product identity. 


7. Prove that the following identities, val 


id for |x| < 1, are consequences of Jacobi's 
triple product identity: 


(a) Tra - x*)-— (i= x5n-4) — p» (— 1)nxmGm* 32. 
n=1 


m--o 


(b) Ila = x*na = x50 "T ales | = y (= 1)nx"nSm* 12 
n=1 m--o 


8. Prove that the recursion formula 


npn) = Y opin — i 
k=1 


obtained in Section 14.10, can be put in the form 


np(n) = È mp(n — km). 


m-1ksn/m 


9. Suppose that each positive inte 
is a positive integer. Let P,{n) d 
Part k appears in at most g(k) 


326 


Ber k is written in g(k) different colors, where g(k) 
enote the number of partitions of n in which each 
different colors. When g(k) = 1 for all k this is the 


Exercises for Chapter 14 


h unrestricted partition function p(n). Find an infinite product which generates 
p(n) and prove that there is an arithmetical function f (depending on g) such that 


npjn) — Àj (k)p,(n — k). 


10. Refer to Section 14.10 for notation. By solving the first-order differential equation 
in (22) prove that if |x| < 1 we have 


Ta -xy = apf fat HO a}, 
neA 


where 


H(x) = X falk)x* and falk) = Z f(a. 
k=1 
deA 
Deduce that 
Ila = xy —e7* for |x| < 1, 


where u(n) is the Mobius function. 
The following exercises outline a proof of Ramanujan's partition identity 


J m x eer Sr T (x) = ( ) 

PS + 4)x 5 , where g(x; Tl 1 — x"), 
eT =1 

by a method of Kruyswijk not requiring the theory of modular functions. 


11. (a) Let e = e?*!* where k > 1 and show that for all x we have 


[la - xe) -1- ». 
h=1 


(b) More generally, if (n, k) = d prove that 
k 
Jla- xe") = (1 -—- xy, 
h=1 
and deduce that 


e Den 
[I - vem - [7 —xy ifk|n. 


h=1 


12. (a) Use Exercise 11(b) to prove that for prime q and |x| < 1 we have 


o 4 a (xt? 
II IIa - e TOE 


n=1h=1 


(b) Deduce the identity 
See oe ot 2i Iia - xre?=iNS), 


m=0 h=1 n=1 
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13. If q is prime and if 0 € r < q, a power series of the form 


n=0 


is said to be of type r mod q. 
(a) Use Euler’s pentagonal number theorem to show that (x) is a sum of three 
power series, 


a 


(x) = Td-=h+h+h, 


n= 


where I, denotes a power series of type k mod 5. 
(b) Let æ = e?"!5 and show that 


ll Ia = x"o") = To + Iya" + 1,0). 


h=1 n=] h-1 
(c) Use Exercise 12(b) to show that 
S (x?) 
Sm + 4)x5n** — y, —, 
LA inc * e(x*ys 
where V, is the power series of type 4 mod 5 obtained from the product in part (o). 


14. (a) Use Theorem 14.7 to show that the cube of Eul 


ler’s product is the sum of three 
power series, 


Ox) = Wo + W, + Wy, 


where W, denotes a power series of type k mod 5, 
(b) Use the identity W, + W, +W = (Ip +1, + I)? to show th: 
series in Exercise 13(a) satisfy the relation 


at the power 
TE = —1,2. 
(c) Prove that J; = —xo(x?5). 


15. Observe that the product Tk tot: la + I,a?) isa homogeneous polynomial 
in lo, 1,,1, of degree 4, so the terms contributing to series of type 4 mod 5 come from 
the terms 7,4, I5 1,71, and 1? "Ed 


(a) Use Exercise 14(c) to show that there exists a constant c such that 
V - ch*, 
where V, is the power series in Exercise 13(c), and deduce that 
E] x15) 
PL + 4)xmt4 = ext = 
5 and deduce Ramanujan’s identity 
(x8 
g(x) ` 


(b) Prove that c = 


EC 4x" = 5 
m=0 
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